The Analysis of Feedback Systems 



Jan C. Willems 




Research Monograph No. 62 

THE M.I.T. PRESS 

Cambridge, Massachusetts, and London, England 



Contents 



Copyright © 1971 by 
The Massachusetts Institute of Technology 

Set in Monotype Times Roman 

Printed and bound in the United States of America by 

The Colonial Press Inc., Clinton, Massachusetts 

All rights reserved. No part of this book may be 

reproduced in any form or by any means, 

electronic or mechanical, including photocopying > 

recording, or by any information storage and 

retrieval system, without permission in writing from the publisher. 

ISBN 262 23046 1 {hardcover) 
Library of Congress catalog card number: 77-113730 



Preface 

Chapter 1 : Introduction 

1.1. Orientation 

1.2. Mathematical Preliminaries 

1.3. Transform Theory 

Chapter 2: Nonlinear Operators 

2.1. Introduction 

2.2. Operators-: Generalities 

2.3. Extended Spaces 

2.4. Causal Operators 

2.5. Algebras: Generalities 

2.6. Operator Algebras 

2.7. Contractions, Conic Operators, Positive Operators 

2.8. Conditions for Invertibility of Nonlinear Operators 

2.9. Conditions for Noninvertibility of Nonlinear Operators 

Chapter 3 : Positive Operators 

3.1. Introduction 

3.2. Positive Convolution Operators and Positive Memoryless 
Operators 

3.3. Memoryless Time-Invariant Nonlinear Operators 

3.4. Periodic Gain 

3.5. Positive Operators with Monotone or Odd-Monotone Non- 
linearities 



XI 



1 

2 
6 

10 

10 
11 
12 

14 
16 
19 
24 
28 
40 

45 
45 

49 
51 
53 

51 



viii CONTENTS 

3.6. Frequency-Power Relations for Nonlinear Resistors 

3.7. Factorization of Operators 

3.8. Positive Operators on Extended Spaces 

Chapter 4: Feedback Systems 

4.1. Introduction 

4.2. Mathematical Framework 

4.3. Well-Posedness of Feedback Systems 

4.4. Stability and Instability 

4.5. Continuity and Discontinuity 

4.6. Concluding Remarks 

Chapter 5: The Nyquist Criterion and the Circle Criterion 

5.1. Mathematical Description of the Feedback System 

5.2. Well-Posedness 

5.3. Stability and Instability in the Time-Invariant Case 

5.4. Stability and Instability in the Time- Varying Case 

Chapter 6: Stability Criteria Obtained Using Multipliers 



6.1. Introduction 

6.2. Transformations of the Feedback Loop 
A Stability Criterion for Linear Feedback Systems with a 
Periodic Gain in the Feedback Loop 

A Stability Criterion for Feedback Systems with a Monotone or 
an Odd-Monotone Nonlinearity in the Feedback Loop 



6.3. 



6.4. 



Chapter 7 : Linearization and Stability 

7.1. Linearization 

7.2. Linearization, Stability, and Continuity 

7.3. The Describing Function, the Total Gain Linearization, and the 
Incremental Gain Linearization 

7.4. Averaging Theory 

7.5. Counterexamples to Aizerman's Conjecture 

Index 



66 

72 
81 

86 

86 

87 

93 

101 

110 

114 

122 

122 
124 
125 
130 

136 

136 
138 

140 

155 

161 

161 
166 

169 

173 
177 

185 



Foreword 



This is the sixty-second volume in the M.LT. Research Monograph 
Series published by the M.I.T. Press. The objective of this series is to 
contribute to the professional literature a number of significant pieces 
of research, larger in scope than journal articles but normally less 
ambitious than finished books. We believe that such studies deserve a 
wider circulation than can be accomplished by informal channels, 
and we hope that this form of publication will make them readily 
accessible to research organizations, libraries, and independent 
workers. 

Howard W. Johnson 



Preface 

There are two ways, distinct in principle, of mathematically describing 
physical systems. The first one is called the "input-output description" 
since it relates external variables. The mathematical model then usually 
takes the form of an integral equation (the Green's function approach) 
or more generally, of an operator equation expressing the relationship 
between the inputs (the variables which can be manipulated) and the 
outputs (the variables of interest — typically the readings of a set of 
sensors). Such an input-output description can usually be obtained 
from some representative experiments. This approach requires minimal 
knowledge of the physical laws governing the system and of the inter- 
connections within the "black box." 

For the "internal description" of a physical system, on the contrary, 
one uses these physical laws and interconnections as the basis of the 
mathematical model. This generally takes the form of an ordinary 
differential equation or a partial differential equation. In the process 
leading to this model one works with a set of intermediate variables, 
related to the concept of state. There are thus two parts to mathematical 
models of internally described systems: a dynamical part — which 
describes the evolution of the state under the influence of the inputs, 
and a memoryless part — which relates the output to the state (and 
sometimes to the instantaneous value of the input as well). 

Which of the two descriptions is more convenient depends on the 
application and the purpose of the analysis or the synthesis that one has 
in mind. Modern system theory relies heavily on the state formulation 

xi 
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for synthesis techniques as exemplified by the highlights of modern 
control theory : Pontryagin's maximum principle, the regulator problem 
for linear systems, and the Kalman-Bucy filtering theory. 

In the analysis of control systems one usually investigates questions 
related to stability, continuity, and sensitivity of a closed-loop system. 
These questions can be treated from both an input-output or a state- 
space point of view, but it is only very recently that successful results 
have been obtained and that a sufficiently general framework has been 
developed to treat them in an input-output setting. The pioneering 
work in this development has been performed by I. W. Sandberg at Bell 
Laboratories and G. Zames at NASA-ERC. These authors formulated 
the stability question (the most important design constraint in feedback 
control) in an input-output setting. The idea of input-output stability 
finds its roots in the concept of bounded-input, bounded-output 
stability and in the work of Nyquist. Nyquist takes the finite integrability 
of the impulse response as the basic requirement for stability, whereas 
the concept of bounded-input, bounded-output stability requires that 
bounded inputs produce bounded outputs. The idea of Nyquist gives 
an excellent type of stability but unfortunately applies only to a very 
restricted class of systems, the linear time-invariant systems. The 
concept of bounded-input, bounded-output stability never has had 
much success, and very few specific results have been based on it. 
Moreover, it has been no simple matter to analyze feedback systems in 
this context, in which they are described by implicit equations. The key 
in the generalization of these methods to feedback systems has been the 
introduction of extended spaces. This will be emphasized in the 
subsequent chapters of this monograph. 

This monograph is an attempt to develop further and refine methods 
based on input-output descriptions for analyzing feedback systems. 
Contrary to previous work in this area, the treatment heavily emphasizes 
and exploits the causality of the operators involved. This brings the 
work into closer contact with the theory of dynamical systems and 
automata. (In fact, it can be argued that the very definitions of stability 
and extended spaces are ill-conceived unless the operators involved are 
explicitly assumed to be causal.) 

The monograph is built around Chapter 4, where the relevant 
concepts of weli-posedness, stability, continuity, and sensitivity are 
introduced. The mathematical foundations for this study will be found 
in Chapters 2 and 3. In Chapter 2 nonlinear operators are introduced 
and general conditions for the invertibility or the noninvertibility of 
nonlinear operators are derived. These conditions rely heavily on the 
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theory of Banach algebras and exploit causality considerations in great 
detail Chapter 3 is for the most part devoted to the establishment of a 
series of inequalities, an unpleasant task that mathematicians usually 
leave to applied mathematicians and engineers. Inequalities are the 
workhorses of analysis — and this monograph is no exception to the rule, 
since these inequalities are essential ingredients for the specific stability 
and instability criteria described in Chapters 5 and 6. The use of 
linearization techniques in stability theory is then discussed in Chapter 7. 

The monograph is intended primarily for researchers in system theory. 
The author hopes that it will also be enjoyed by control engineers who 
are eager to find a unified modern treatment of the analysis of feedback 
systems, and by mathematicians who appreciate the application of 
relatively advanced mathematical techniques to engineering questions. 

Parts of this monograph appeared in the author's doctoral disserta- 
tion entitled "Nonlinear Harmonic Analysis" and submitted in June 
1968 to the Department of Electrical Engineering of the Massachusetts 
Institute of Technology, Cambridge, Massachusetts. 



November 1969 



Jan Willems 
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1 Introduction 



1.1 Orientation 



A brief outline by chapters of the content of this monograph seems 
appropriate in order to focus attention on the subject matter, the 
original results, and the framework of the presentation. The first 
chapter is introductory; it deals mainly with mathematical preliminaries 
of a general nature. 

The second chapter is devoted to the study of operators and specifically 
discusses questions related to the invertibility of nonlinear operators. 
This study is made in an algebraic framework, and special emphasis is 
placed on the properties of causal (nonanticipatory) operators. Causal 
operators are indeed of particular interest to engineers and physicists. 
The concept of causality is roughly equivalent to that of a "dynamical 
system" and is a basic restriction of physical realizable systems. In the 
algebraic framework employed in this monograph, causal operators 
are considered as a subalgebra in the algebra of (in general, nonlinear) 
operators. Another heavily emphasized and exploited concept is that 
of extended spaces. These consist of functions which are well-behaved 
on bounded intervals, but which do not satisfy any regularity conditions 
at infinity. Extended spaces have not been used extensively in analysis; 
however, they are the natural setting for the study of causal operators, 
and they form a very elegant conceptual framework for the study of 
dynamical systems described, for instance, by an ordinary differential 
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equation with specified initial conditions or by Volterra integral 
equations. 

The analysis in the third chapter is devoted to the derivation of 
some specific positive operators, which yield the inequalities leading 
to some general frequency-power formulas and stability conditions. 

The basic concepts related to feedback systems are introduced in the 
fourth chapter. Only the analysis problem is considered, and the main 
questions investigated are well-posedness s stability, and continuity. 
This theory is developed in the framework of input-output descriptions 
of systems, and thus— following the modern trend of mathematical 
system theory — departs somewhat from the classical methods, which 
consider undriven systems with initial disturbances. 

The fifth chapter discusses the Nyquist criterion and the circle 
criterion. These yield graphical conditions for stability and instability 
of linear (possibly time-varying) systems in terms of frequency-response 

data. 

The sixth chapter is devoted to the study of some more complex 
stability criteria, which apply to systems with a linear time-invariant 
system in the forward loop and a periodically time-varying gain or a 
monotone memoryless nonlinearity in the feedback loop. 

The final chapter discusses linearization techniques and shows that 
properly defined linearizations can indeed be successfully used for the 
analysis of the continuity of feedback systems. This linearization, 
however, is of a dynamical type and leads to time-varying systems even 
when the original system is time invariant. The final chapter also 
contains a simple and rather general class of counterexamples to 
Aizerman's conjecture. 



1.2 Mathematical Preliminaries 

This section introduces some notation and definitions which will 
be freely used throughout this monograph. More details may be found, 
for instance, in Refs. 1, 2. 

A set (or space) is a collection of objects with a common property. 
The set, S, of objects with property P is denoted by S A {x \ x has 
property P}. A subset S x of a set S, denoted S x <= S, is defined as 
S x = {x | x e S and x has property P x } and is sometimes denoted by 
S x = {x e S | x has property PJ. The sets R, R+, 7, and 7+ denote 
respectively the real numbers, the nonnegative real numbers, the 
integers, and the nonnegative integers. 
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The union of the sets S x and S 2 , denoted by S x U S a , is defined as 
S x u S 2 A {x | x e S x orxe SJ. The intersection of the sets S x and 
S 2 , denoted by S x n S 2 , is defined as Si n S a A {x \ x e S x and 
x e SJ. The Cartesian product of two sets, denoted by S x X S 2 , is 
defined as S x x S 2 A {(x x ,x 2 ) \ x x e S l9 x 2 e S 2 }. 

A map F (or operator or function), from a set S x into a set S 2 is a law 
which associates with every element x e S x an element Fx e S 2 . S x is 
called the domain of the operator. If S' x <= Sj (S;' <= S[) and if F' and 
F" are maps from S[ into S 2 and S x into S 2 such that F'x = F"x for all 
xeS[{x£ SJ), then F' is called the restriction (an extension) of F" to 
Si (Si'). A sequence is a map from 7 (7+) to a set S and will be denoted by 
{x n },*eJ(«e/+). 

A metric space is a set X and a map, d, from Ixl into R + such that 
for all x, y, z e X, the following relations hold: d(x,y) = d(y,x) > 0; 
d(x,y) + d(y,z) > d(x,z) (the triangle inequality); and d(x,y) = if 
and only if x = y. 

A sequence {x n }, n e7+, of elements of a metric space X is said to 
converge to a point x e X if lim^^ rf(x„,x) = 0. This /iiwi7 ]?<?/«/ x is 
denoted by lim^^ x n . 

A subset, S, of a metric space, X, is said to be open if for any xeX 
there exists an e > such that the set N£x) A {7 e X \ d(x,y) < *} is a 
subset of S. A subset, S, of a metric space, X, is said to be closed if any 
converging sequence {x n }, n e /+, of points in S converges to a point 
in S. A sequence {*„}, n e 7 + , of elements of a metric space S is said to 
be a Cauchy (or fundamental) sequence if given any e > there exists 
an integer iV such that d(x n ,x m ) < e for all n, m > AT. A metric space 
is said to be complete if every Cauchy sequence converges/Completeness 
is one of the most important properties of metric spaces. A subset of a 
metric space is said to be compact if every bounded sequence has a 
convergent subsequence. 

A vector space (sometimes called a linear space or a linear vector 
space) is a set V and two maps, one called addition, denoted by +, 
from V x V into V 9 and the other called multiplication from the 
Cartesian product of the field of scalars K (which will throughout be 
taken to be the real or complex number system) and Finto V such that 
for all x, y, z e V and a, /? e K: 

1. (x+y) + * = .* + (y + z); 

2. there exists a zero element, denoted by 0, with* + = + ;c — *; 

3. there exists a negative element, denoted by — x, with x + (— x) — 
(j, + (— x) will be denoted by y — x) ; 
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4. x + y = y + z; 

5. (a + fix = olx + fix; 

6. a(x + 7) = ax + ay; 

7. (a^ = <x(Px) ^oipx; 

8. 1 • * = x 

A vector space is called a real or complex vector space according to 
whether the field K is the real or complex number system. R n denotes 
the real vector space formed by the /z-tuples of real numbers with 
addition and multiplication defined in the obvious way. A vector space 
V is called a normed vector space if a map (the norm), denoted by 
|| || , from V into jR+ is defined on it, such that: 

1. ||x|| =Oifandonlyifx = 0; 

2. HocjcH = |oc| 

3. II x + y\ < \x\ + \\y\\ (the triangle inequality). 

The norm induces a natural metric d{x,y) A \\x — y\\, and all statements 
(e.g. concerning convergence) always refer to this metric, unless 
otherwise mentioned. Sometimes the norm is subscripted for emphasis, 
as || || F , but the subscript will be deleted whenever there is no danger of 
confusion. 

A Banach space is a complete normed vector space. This completeness 
is, of course, to be understood in the topology induced by the natural 
metric. A very useful class of Banach spaces are the so-called L^spaces. 
These consist of Banach space 2?-valued functions defined on a 
measurable set S <= R, for which the pth power of the norm is 
integrable 1 in the case 1 < p < oo, with the norm defined by 



\\x\\ L?{S) A^jjx(t)\\%dty 



The space L%(S) is defined as the collection of all measurable jB-valued 
functions defined on a measurable set S <= R which are essentially 
bounded (i.e., there exists a real number M < go such that ||x(0IIb < M 
for almost all teS) with 

IMIzl«y> — ( inf M I IWOIIb < M almost everywhere on S}. 

1 The integration and raeasurability considerations refer to Lebesgue measure and 
integration when the Banach space B is finite-dimensional. Otherwise, these notions 
are to be interpreted in the sense of Bochner (see, e.g., Ref. 1, p. 78). 
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The sequence spaces If are defined in an analogous way, with the 
integral replaced by a summation. When B is taken to be the real or 
complex numbers and S is the interval (— oo, +<x>) for Z^-spaces or 
S = I for /^-spaces, then Lf (5) and If (S) will be denoted by L v and l 9 
respectively, when no confusion can occur. The Z^-spaces are very 
often used in analysis. The triangle inequality for Z^-spaces is known 
as Minkowski's inequality. Another useful inequality for Z^-spaces is 
Holder's inequality, which states that for fe L P (S) and g e L q (S) with 
VP + 1/? == 1 and 1 <t? ? ? < oo,/g eL x (5), and 

WfgWuis) < l!/ll£,(fif> teWLtisn- 

An inner product space is a linear vector space, V, with a map, 
denoted by ( ) and called the inner product, from V x Kinto the scalars 
K such that for all x,y,zeV and scalars a, j8, the following relations 
hold: 

\ t ( x ,y) = (y,x) (the overbar denotes complex conjugation); 

2. (out + Py 9 z) = *{x,z) + P(y,z); and 

3. (x,x) > and (x,x) = if and only if x = 0. 

The inner product induces a natural norm \\x\\ = \f(x,x) and all 
statements (e.g., concerning convergence) always refer to the metric 
induced by this norm, unless otherwise mentioned. Sometimes the 
inner product is denoted by (,) v , but the subscript will be deleted 
whenever there is no danger of confusion. 
The Cauchy-Schwartz inequality states that \(x,y)\ < ||*|| \\y\\- 
A Hilbert space is a complete inner product space. This completeness 
is, of course, to be understood in the topology induced by the natural 
norm. The standard example of a Hilbert space is R n with (x,y) R n = 
2lLi xm where x = (x l9 x % , . . . , x n ) and y = (y l9 y S9 . . . , jO- A very 
useful class of Hilbert spaces are the L 2 -spaces. These consist of 
Hilbert space iZ-valued functions defined on a measurable set S <= R, 
for which the square of the norm is integrable, and with 

<x>y)zf<8) Aj s (x(t),y(t)) H dt. 
Similarly, If (S) with S c Zis a Hilbert space, with 



(x,y)i*t8) A 2 <**0Off • 

neS 
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1.3 Transform Theory 

Definitions: If x e L l9 then the function X defined by 



X(ja>) 



J +00 
XI 
-00 



(t)e~ itot dt 



is called the Fourier transform of x. Clearly XeL m and \X\ Lw < 
11*11 z^ if *(f) is real, then X(jcd) — X(—jco). Since this transform need 
not belong to L x , it is in general impossible to define the inverse Fourier 
transform. However, if X itself turns out to belong to L x then 



*(0 



Z.TT J— oo 



X(ja))e jait dco 



(As always, this equality is to be taken in the L ± sense, that is, except 
on a set of zero Lesbesgue measure,) Thus the need of a slightly more 
general transform in which the inverse transform can always be defined 
is apparent. This is done by the limit-in-the-mean transform. It is well 
known that if x, y e L % n L x then X, YeL z and (x,y) = (X, Y)\2it 
{ParsevaVs equality). Let xeL 2 . Since L x n L z is dense in L 2> i.e., 
since any Z, 2 -function can be approximated arbitrarily closely {in the L % 
sense) by a function in L x n L 2 , there exists a sequence of functions 
{x n } in L 2 n jL x which is a Cauchy sequence and which converges to x 
(in the L % sense). Let X n be the Fourier transform of x n . It follows from 
the Parseval relation that i|x n — xj| = ^Tr)- 1 ' 2 !^ - X m \\ and that 
X n e L 2 . Thus since L 2 is complete, these transforms, X n9 converge to an 
element X of L 2 . This element X is called the limit-in-the-mean transform 
of x. It follows that the limit-in-the-mean-transform maps L 2 into itself 
and that (x,y) = {X, Y)\2tt for all xjel 2 and their limit-in-the-mean 
transforms X, F. 

One way of defining a limit-in-the-mean transform is by 



X(» = lim ( T x(t)e- ja,t dt 

T-*coJ-T 



where the limit is to be taken in the L % sense. (It is easily verified that 
this induces a particular choice for the Cauchy sequence {x n },) The 
notation that will be used for the limit-in-the-mean transform is 



X(ja>) = 1 



a.m. xi 

J— 00 



(t)e- iiat dt. 
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With this definition of transforms, the inversion is always possible, and 
the inverse transform formula states that 

x(0 = l.i.m. — f +a 10'a))^ dm. 

2*77"*/— oo 

Let x eL 2 (0 9 T), T > 0. Then the sequence X = {x k } 9 Ice I, given by 

x k = ±rx(t)e-* 2 « t/T dt 9 
Tjo 

is well defined since L 2 (0,T) <= 1^(0, T), and is called the Fourier series 
of x(t). Clearly X e /«, and x k — x_ k whenever x{t) is real. The Parseval 
relation states that if x l9 x 2 e L 2 (0,T) and if X l9 X % are their Fourier 
series, then X l9 X % e / a and <x ls x 2 > i2(0 T) = 27r(X 1 ,Z 2 > v 

In trying to obtain the inverse Fourier series formula, the same 
difficulties are encountered as with the inverse Fourier transform, and 
the same type of solution is presented. This leads to 



+ 00 



x(0 = l.i.m. 2 x/"*'. 

k=— oo 

One way of expressing this l.i.m. summation is by 

x(t) = lim f x k e m « tlT 9 

where the limit is to be taken in the L 2 (0,T) sense. 
If x e l ls then the function Z defined by 

+co - - - ■- 

x(z) - 2 *** 

fc=— 00 

exists for all \z\ = 1 and is called the z-transform of x. In trying to 
extend this notion to sequences in / 2 the same difficulties and the same 
solution as in the previous cases present themselves. This leads to the 
limit-in-the mean z-transform 



X(z) = l.i.m. 2 V* 

ft=— 00 

and the inverse z-transform 

X(z)z' x dz 



27T*/|«1=1 
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where the integral is interpreted in the usual manner since 

L 2 (|z| - 1) <= L ± (\z\ - 1). 

A continuous function, x, from R into K is said to be almost periodic 
if for every e > there exists a real number / such that every interval 
of the real line of length / contains at least one number r such that 

1*0 + r) - x(t)\ < e for all L 

Some properties of almost periodic functions are: 

1 . Every almost periodic function is bounded and uniformly continuous. 

2. Continuous periodic functions are almost periodic. 

3. The sums, products, and limits of uniformly convergent almost 
periodic functions are almost periodic. 

4. The limit, as T-* oo, of the mean value 



JL P 
2TJ~ 



x(t + r) dt 



exists, is independent of t for all almost periodic functions x 9 and 
converges uniformly in r. 
5. If x 1 and x z are almost periodic functions then so is 



X-i H* X« 



1 C T 

A lim — X x (t — r)x 2 (r) dr. 

T-*oo 2T J-T 



Moreover, x ± * x 2 ~ x% * *i and x x * {x % * x 3 ) = (* x * ;c 2 ) * x z for 
all almost periodic functions x u x %y * 3 . 

6. The function 

1 f T 
lim — x(t)e' imt dt 

T-co 2T J-T 

vanishes for all but a countable number of values of eo. 

7. The space of almost periodic functions forms an inner product 
space with 

1 C T 
(x l9 x 2 ) = hm — I x 1 (t)x^t) dt 

T-+ao LI J-T 

for x l9 x % almost periodic functions. (This inner product space is, 
however, not complete and not separable.) Let x be an almost 
periodic function and let {co k } be the set of values for which the limit 
in (6) does not vanish and let x k be the value of that limit for to = co k . 
The sequence {x k } is called the generalized Fourier series of x(t). If 
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x(t) is real, then o> belongs to the set {co k } if and only if -co does 
and the values x k associated with a> and -co are complex conjugates. 
The inverse Fourier series is defined as 



N 



j&kt 



x(t) = lim 2 x k e 

This limit, which exists, is to be taken in the metric induced by the 
inner product on the space of almost periodic functions. 2 

2 For more details on transform theory, see Refs. 3, 4. 
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2 Nonlinear Operators 



2.1 Introduction 

Many of the function spaces (e.g., Z^-spaces) encountered in applied 
mathematics carry the time (or, more generally, a real variable) as an 
essential parameter. This is particularly true in system theory where 
the purpose is to study the response of physical systems subject to 
inputs or initial conditions. Physical systems are furthermore non- 
anticipatory with respect to the parameter "time" in the sense that past 
and present values of the outputs do not depend on future values of 
the inputs. This property leads to the fact that physical systems may be 
described by a particular class of operators. These operators are 
referred to as causal operators: 1 they describe a nonanticipatory 
dependence of outputs on inputs with the range and the domain of the 
operator parametrized by a real variable, which plays the role of the 
time. 

This causality of operators is very often implicitly used in analysis, 
but a systematic exploitation of this structure is not often carried out 
explicitly. This chapter contains a systematic study of extended spaces 
which are believed to be the natural setting for function spaces param- 
etrized by the time, and of causal operators which describe a non- 
anticipatory input-output behavior. 

Mathematics evolves around equations and their solution. The study 

1 Causality is the fundamental property of physically realizable systems. Causality 
is essentially equivalent to the existence of a state (Ref. 1) and is thus the basic 
property of dynamical systems. For a study of causality from an abstract point of 
view, see Refs. 2, 3. 

10 
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of a nonlinear equation usually concentrates on the existence and 
uniqueness of solutions and on algorithms for the computation of the 
solution. This chapter contains a number of theorems concerning the 
solvability of nonlinear operator equations and describes appropriate 
algorithms for their solution. These involve conditions for the 
invertibility of operators in terms of contractions, conicity, and 
positivity considerations. Some of these conditions are standard. 
Others, which exploit the causality of the operators in an essential way, 
have, to the author's knowledge, not appeared in the mathematical 
literature before. All of the theorems which follow have direct applica- 
tions in the study of feedback systems and are introduced with these 
applications in mind. 



2.2 Operators: Generalities 

This section contains a number of general notions concerning 
nonlinear operators. Most of these notions are standard and can be 
found in the usual texts on functional analysis. 

A mapping from a space X into a space Y is called an operator from 
X into Y. Thus an operator associates with every element x e X a 
unique element y A Fx e Y. X is called the domain of F, denoted by 
Do OF), and Ra (F) A {y e Y \y = Fx, xeX} <= Y is called the 
range of F. 

An operator F from a metric space X into a metrix space Y is said to 
be continuous if every convergent sequence {xj, n e /+, yields a 
convergent sequence {Fx n }. It is said to be Lipschitz continuous if 

d(Fx,Fy) ^ 
sup — — —^ < go. 
*,isi d(x,y) 

This supremum will be called the Lipschitz constant of F on X. An 
operator F from a normed vector space X into a normed vector space 
Y is said to be bounded 2 if F0 = and if 

11**11 ^ 

sup inr < co - 

«or ||x|| 

This supremum will be called the bound of F. 

2 It appears to be no simple matter to define boundedness of a nonlinear operator 
in a satisfactory way. "Bounded sets into bounded sets'* appears to be the most 
logical—but mathematically least manageable— one. The norm relation ||Fx|| < 
M x + M 8 1| x || has been proposed and, although it has some advantages over the one 
adopted here, it is equally a compromise derived from the definition of bounded linear 
operators. The concept of boundedness adopted here is that of "finite gain." Need- 
less to say, ail of the above definitions become equivalent for linear operators. 
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The operator / from X into itself defined by Ix A x is called the 
identity operator. The operator O from a vector space X into a vector 
space Y defined by Ox A is called the zero operator. 

An operator F from X into F is said to be invertible on X if it is 
one-to-one on X and onto Y. There then exists an operator, denoted 
by F~\ from Y into X such that F~ X F = / and FF" 1 = /. 

Let F x and F % be operators from X into itself. The operators F x and 
F 2 are said to commute on X if F XJ F 2 — F 2 F X on X 

Let F l9 F % be operators from X into the vector space Zand let a be a 
scalar. Then F x + F 2 and aF x denote the operators defined for x e X 
by (F x + F 2 )x A. F x x + F 2 x and (aFi)x A a^x), respectively. 

Let F x and F a be operators from Xinto Y and Y into Z, respectively; 
then F Z F X denotes the operator from X into Z defined by (F % F x )x A 
^aC^i*) anc * * s called the composition of F x and F 3 . 

Let X and Y be vector spaces over the same field of scalars. An 
operator L from X into Y is said to be linear if for all x l9 x 2 e X and 
scalars a, /?, the relation L(ooc t + /tea) = clLx x + /?Ljc 2 is satisfied. 

Let L be a linear operator from the normed space Xinto the normed 
space Y. Then boundedness, continuity, and Lipschitz continuity are 
equivalent. Moreover, the bound of L equals 
sup ||Lx||. 

aseX 
11*11=1 

Let L be an invertible bounded linear operator from a Banach space X 
onto a Banach space Y. Then Lr 1 is also an invertible bounded linear 
operator. In fact, linearity of Lr 1 is immediate and boundedness of Lr 1 
follows from the closed graph theorem (Ref. 4, p. 47). 

Let L be a linear operator from an inner product space X into an 
inner product space Y. If there exists an operator, L*, from Kinto X 
such that for all x e X and y e Y, the relation (y,Lx) Y = (L*y,x)x 
holds, then L* is called the adjoint of L. The adjoint L* is linear and 
uniquely defined whenever it exists, and (L*)* = L (thus (L*)* exists 
if X* does). By the Riesz representation theorem (Ref. 4, p. 43), L* 
exists when L is bounded and X and 7 are Hilbert spaces. Moreover, 
the bound of L* equals that of L, and L is invertible if and only if 
L* is and (L -1 )* — (L*)™ 1 . An operator is called self adjoint if L* — L. 

2.3 Extended Spaces 

In this section the notion of extended spaces is introduced. These 
spaces play an important role in system theory and are very appropriate 
for the study of causal operators. 
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Let S = [r ,ao) or (—00,4- oo). 3 The set S will be referred to as the 
time-interval of definition. Let B be a Banach space, and let Y(B) denote 
the linear space of B- valued functions defined on S, that is, Y(B) ^ 
{x\x:S-+B}. 

Definitions; Let TeS. Then P T denotes the projection operator on 
Y{B) defined for x e Y(B) by 



(P T x)(t) A 



x(t) for t < T, t e S 
otherwise 



P T will be called the truncation operator and P T x will be called the 
truncation of x at time T {P T }, TeS, consists thus of a family of pro- 
jection operators on Y(B). 

Let W c Y(B) be a Banach space. The extended space W e is defined 
as W 6 A{xe Y(B) \ P T x e Wfor all TeS, T finite}. 

The following assumptions 4 are made about the space W: 
W.L The space W is closed under the family of projections {P T } 9 

TeS. 
W.2. For any xe W, the norm \\P T x\\ is a monotone nondecreasing 

function of T which satisfies 



lim ||i» r jc|| =0 and lim \\P T x\\ = |I*||. 

The family of projection operators {P T }> for Te S, is thus assumed 

to be a resolution of the identity. 
W.3. If x e W e then * e W if and only if sup reS ||Pyx|| < 00. 
W.4. For any real numbers t l9 t 2 e S, t x < / a , the vector space 

*F (W2) A {x g FF| *(*) = for / £ fc./J} 

is a closed subspace of W and thus itself a Banach space under the 
norm of W> 

3 The particular choice of S adopted here does exclude discrete systems, but the 
adjustments to treat this case are minor and are left to the reader. In fact, S could be 
taken as an arbitrary subset of the real line. Causality can be defined in terms of 
arbitrary locally compact Abelian groups. This viewpoint, which follows the 
development of modern mechanics, is proposed in References 5 and 6. 

* Not all of these assumptions are necessary for the theorems which follow. 
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Important spaces which satisfy these assumptions are L B (S) with 

l</><00. 5 

The extended space W e is in general a proper extension of W. For 
instance, Lf e (T Q ,co) is the collection of j?-valued measurable functions 
on [T , oo) whose norm is integrable on compact subsets of [T , oo). Note 
that W e is a linear space but, very importantly, that it is not normed. 
Notice also that an element x e W 6 belongs to Wit and only if the set 
M ^ {a e R | a = ||iVc||, Te S} is bounded in R. 



2.4 Causal Operators 

In this section the notion of a causal operator is formally introduced. 
It plays a central role in the following chapters and is a fundamental 
property of physically realizable systems. 

Definitions: An operator F from W'mto itself is said to be causal on W 
if P T F commutes 6 withP^ on JTfor all T e S. It is said to be anticausal 
on W if (/ — P T )F commutes with (/ — P T ) on W for all Te S. 
The operator F is said to be memoryless on Wif F is causal anc? anti- 
causal on W. It is easily seen that a memoryless operator jPis necessarily 
defined, for some map/from B x S into B, by the relation (Fx)(t) A 
f(x(t),t). Fis said to be strongly causal 7 on Wif Fis causal on Wand 
if for all TeS there exists, for any € > and T" e S, T < T, areal 
number AT > such that for any x, y eW with P T ,x = P T >y, the 
relation IIP^^F* - Fy)\\ < e ||P r+Aa ,(x - j;)|| is satisfied. 

Causality is a fundamental property of physically realizable systems. 
It merely expresses that past and present output values do not depend on 
future input values. Furthermore, many systems encountered in practice 
contain an integration or a delay in which case they become strongly 
causal. Strongly causal systems cannot react instantaneously to inputs. 

The definition of causal and strongly causal operators on W is 
completely analogous to these notions on W. A causal operator Ffrom 
W e into itself is said to be memoryless if for all T e S, the operator 
P T FP T is memoryless on W. If F maps W e into itself and W into itself 

5 Indeed, {P T } is not a resolution of the identity if W = £oo. Most of the theorems 
which follow, however, still hold. 

6 This definition is equivalent to requiring that P T x x — P T x % implies that P T Fx x = 
P T Fx % . Other equivalent definitions are given in Reference 2. 

7 Strong causality is a low-pass filtering condition, which essentially expresses the 
fact that there is an infinitesimal delay present in the operator. 
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then Fis causal (memoryless) on Wif and only if it is causal (memory- 
less) on W e . A causal operator F from Winto itself has thus a natural 
causal extension to an operator F e from W e into itself defined for 
xeW 6 and any TeS by P T F 6 x ^ P T FP T x. The above notions of 
causality are analogously defined for operators which map W X (W^ 
into W 2 (W Ze ) where W x and PF 2 satisfy assumptions W.l through W.4 
of Section 2.3. 

Although extended spaces are not normed it is nevertheless possible 
to define the notions of continuity and boundedness for causal operators 
on extended spaces, 8 as is shown by the following definitions. 

Let F be a causal operator from W e into itself. Then F is said to be 
locally (Lipschitz) continuous on W e if P T FP T = P T F is (Lipschitz) 
continuous on W for all Te S. It is said to be locally bounded on W e if 
FO = and if, for all TeS, P T FP T is bounded on W. 

It is easily established that if F is a Lipschitz-continuous (bounded) 
operator from W e into itself, then the Lipschitz constant (bound) 
K T ofP T FP T on W is a monotone nondecreasing function of T. 

Let F be a causal operator from W e into itself. Then F is said to be 
Lipschitz continuous (bounded) on W 6 if it is locally Lipschitz continuous 
(bounded) on W e and if the Lipschitz constant (bound) of P T FP T on W, 
K T , satisfies sup TeS K T = lim^. K T < go. This supremum will be 
called the Lipschitz constant (bound) of F on W e . 

The following theorem relates Lipschitz continuity and boundedness 
on W e to the analogous notions on W. 

Theorem 2.1 

Let F be a causal operator from W e into itself, with FO = 0. If F 
is Lipschitz continuous (bounded) on W 6 then F maps W into itself 
and the Lipschitz constants (bounds) of F on W & and W are equal. 
Conversely, if F maps W into itself and is Lipschitz continuous 
(bounded) on W, then Fis Lipschitz continuous (bounded) on W e and 
the Lipschitz constants (bounds) of F on W and W e are equal. 

8 It was decided not to define boundedness or Lipschitz continuity on extended 
spaces for operators which are not causal. One could indeed define a bounded 
operator on W e as one for which there exists a K < oo such that \\P T Fx\\ < K \\P T x\\ 
for all Te S t x G W t . The author believes, however, that the very existence of ex- 
tended spaces is only justified when used in connection with causal operators. 
Indeed, implicit in the definition of extended spaces is the fact that the past and the 
future are regarded as essentially different; thus unless the operators on those 
extended spaces show a similar property, such a distinction would appear to be ill 
founded. The extended space defined in this section is sometimes referred to as the 
causal extension of W. 
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Proof of Boundedness: Let K and ^T 6 denote the bounds of F on W and 
JF e respectively. Let K e < oo and * e *F, then P* e W 6 and ||P r Fx|| < 
K \\P T x\\ < 7^ e ||x||. Thus FxeW and \\Fx\\ < A.||x||. Hence F maps 
JF into itself, is bounded on W, and K < ^ e . Let x e JF e , then 
||P r pJcII = HPyFP^H < ||PP T *|| < K \\P T x\\ which shows that K 6 < 
J£ Conversely, let K < oo and jc e W 6 , then ||P T Px|| < Jf ||P T ^||, 
which shows that F is bounded on W e and that K & < JC. Let xeW, 
then PxeHK and ||P r Px|| < K e \\P T x\\ < ^ 6 ||*||. Hence \\Fx\\ < 
^WlandJf <iS;. 

Proof of Lipschitz continuity: Since by assumption PO = 0, Lipschitz 
continuity implies boundedness with the Lipschitz constant greater 
than or equal to the bound. The proof of boundedness thus applies 
and shows that Lipschitz continuity on W e implies that F maps W'mto 
itself. The rest of the proof evolves parallel to the proof of boundedness, 
with the inequalities in the differences of x — y and Fx — Fy. The 
details are left to the reader. 

Theorem 2.1 thus shows the equivalence for causal operators of 
Lipschitz continuity (boundedness) on W e and W respectively. This 
important property of causal operators plays a central role in stability 
theory. Theorem 2.1 is easily extended to causal operators which map 
W u into W u . 



2.5 Algebras: Generalities 

As is .well known, the natural setting for the study of linear operators 
on a Banach space is as a linear Banach algebra. It is not as generally 
appreciated, however, that most of these ideas carry over to nonlinear 
operators as well. This is the subject of Sections 2.5 and 2.6. Section 
2.5 contains the relevant notions from algebras, and Section 2.6 imbeds 
nonlinear operators in this algebraic framework. 

Let A be a vector space with a mapping {multiplication) from A x A 
into A defined. Then A is said to be an {in general nondistributive) 
algebra if {xy)z = x(yz) A xyz for all x, y, z e A. An algebra is said 
to be left-distributive if (x + y)z — xz + yz and (o(.x)y — <x-(xy) A &xy, 
for all x 9 y, z e A and scalars a. It is said to be linear (or distributive) 
if it is left-distributive and if x{y + z) = xy + xz and x(oiy) = cucy, 
for all x, y, z e A and scalars a. A linear algebra is said to be commuta- 
tive if xy = yx for all x, y e A. 
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An algebra is said to have a unit if there exists an element e e A such 
that xe — ex — x for all x e A, An element x of an algebra A with a 
unit e is said to be regular (or invertible) in A if there exists an element 
x" 1 e A such that x~ x x — xx~ x = e. It is easily seen that there exists 
at most one unit and one inverse. The unit is always invertible, and 
e- 1 = e. If x and y are invertible, so is xy<> and {xy)~ x — j -1 x _1 . 

An algebra is said to be a Banach algebra if the vector space which 
defines A is a Banach space, and if \xy\ < \x\ \\y\\ for all x, y eA. 

A subset A x of an algebra A is said to be a subalgebra of the algebra 
A if A 1 is closed under the operations in A (addition, scalar multiplica- 
tion, and multiplication of elements). A subset A x of a Banach algebra 
A is said to be a subalgebra of the Banach algebra A if A x is itself a 
Banach algebra under the norm of A (thus, if A x is a subalgebra of the 
Banach algebra A, then A x is closed in the norm topology of A). A 
subalgebra A x of the algebra A is said to be regular in A if all elements 
of A ± which are invertible in A are invertible in A x . 

A subset M of an algebra A is said to be an ideal {left or right ideal) 
in A if xy, yx eM {xy e M,yx e M) for any xs M and j;e^. Note 
that if an ideal M in A contains the unit, then M = A, and that if it 
does contain the unit, then no element of M is regular. 

Let x be an element of an algebra A with unit e. The spectrum of x in 
A, denoted by o{x), is defined as 

a{x) A {a complex | ae + x is not invertible in ^t}. 

The resolvent of x in >4, denoted by p{x), is the complement in the 
complex plane of a{x). 

Definition: A Banach algebra A with a unit is said to have the con- 
traction property if — 1 £ <r(x) whenever ||x|| < 1. It follows that if a 
Banach algebra .4 with a unit has the contraction property then the 
spectrum of any element is a compact set in the complex plane. Any 
linear Banach algebra has the contraction property since 

for ||a|| < 1. 



(e + a)" 1 = IX-*) n 



The following theorem will play an essential role in the method of 
showing instability of feedback systems that will be introduced in 
Chapter 4. 

Theorem 2.2 

Let A be a left- distributive Banach algebra with a unit e, and let A+ 
be a subalgebra of A which contains the unit. Assume that A+ has the 
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contraction property. Let x>y e A + , and let C be a connected set in the 
complex plane. Assume that x + &y is invertible in A for all a e C and 
that the set M a {(x + ay)" 1 1 a e C} is a bounded set in A. Then 
(x + ay) -1 6 ^t + for all a e C if and only if (x + ^y)~ x e ^4 + for 
some a e C. 

Proof: Necessity is obvious. For sufficiency it is assumed that 
(x + aoj)" 1 g .4 + for some a e C. The conclusion clearly holds if 
y — 0. Assume thus that ||j;|| ^ 0, and let K A sup ae(7 ||(x + ay) -1 ||. 
Notice that iT > 0. It will first be shown that if (x + &iy)~ x e ^4+ for 
some complex number ax e C, then x + aj is regular in A + for 
all a g TV" A {a complex | |a — a x | < HjH" 1 ^" 1 }. Write x + ay as 
(x + ay) = (e + (a — a x )j(.x: + a^) -1 )^ + oy). The claim then 
follows from the contraction property and the obvious estimate 

|| (a - a^yix + a^)" 1 !! < |a - a x | \\y\\ \\(x + a^)" 1 !]. 

Let P denote the set in the complex plane defined by 

P a {a complex | x + ay is regular in A + }, 

and let P c denote its complement in the complex plane. The theorem 
claims that P c n C is empty. The proof goes by contradiction: assume 
that P c n C is not empty. Then 

rf(P nC,P e r\C) A inf | a' - a" | > || yir 1 JST 1 . 

aVnc 
Let 

JV X A {a complex | |a — o^l < \\yW~ 1 KTV3, a x e P c n C} ; 
iV 2 A { a complex | |a - a x | < \\yW~ 1 K' 1 ^ a 2 G P n C}. 

The sets iV^ and JV 2 are open, nonempty by assumption, and their union 
contains C Hence C is not connected. This contradiction establishes 
the theorem. 

Theorem 2.2 indicates a method for establishing that an inverse 
belongs to a subalgebra at a value of a parameter by showing invertibility 
in the subalgebra at another value of this parameter and invertibility 
in the large algebra along a continuous line in the complex plane 
joining these values of the parameter. 

Notice also that the boundedness assumption can be replaced by 
assuming that C is compact and connected, that A has the contraction 
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property, and that ||0 + x)- x || < (1 - Hall)" 1 for ||*|| < L The 
assumption implies that ||(jc + ay) - * 1 ]! is a continuous function of a 
and is satisfied, for instance, when A is a linear Banach algebra. The 
boundedness condition then follows from the fact that a continuous 
real-valued function on a compact set attains its maximum. 9 

2.6 Operator Algebras 

The following convention will be assumed throughout : 

C.1- All operators that map a vector space into a vector space, map 
the zero element into the zero element. 10 

It is convenient for purposes of analysis to classify the operators 
from W e into itself and those from W into itself. This section contains 
a listing of some important operator algebras. 

Definition: Let df(W ei W^ denote the class of operators from W e into 
itself, that is, let Jf(W„W^ A {x \ x: W e ->W e }. Let addition and 
scalar multiplication in J r (W 09 W e ) be defined in the obvious way, and 
let multiplication be defined as composition of maps. These definitions 
lead immediately to Theorem 2.3. 

Theorem 2.3 

The operators in 'A r (W e ,W e ) form a left-distributive algebra with a 
unit. 



Some important subalgebras oi Jf(yV e ,W^ are: 

1. The class ^ r+ (W e ,W e ) of causal operators from W e into itself. 

2. The class J?(W e ,W e ) of linear operators from W into itself. 

3. The locally continuous operators from W e into itself. 

4. The locally Lipschitz continuous operators from W e into itself. 

5. The locally bounded operators from W e into itself. 

6. The Lipschitz continuous operators from W e into itself. 

7. The bounded operators from W e into itself. 

8. The memory less operators from W e into itself. 

9 For more details on algebras > see Reference 4. 

10 It is possible to dispense with this assumption at the expense of a somewhat 
laborious notational framework. Any operator, however, can be fixed up so as to 
satisfy this condition by defining F'x A Fx — F0, and replacing Fx by F'x + F0. 
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Subalgebras 1-8 and any intersection of them form subalgebras 
of jV(W e ,W^ which contain the unit. 
9. The strongly causal operators from W 6 into itself form a subalgebra 
of */K(W a ,WQ which does not contain the unit. This class of 
operators is in fact a subalgebra of Jf+iW** W e ) an d an ideal in the 
class of locally Lipschitz continuous operators from W 6 into itself. 
Thus no strongly causal operator is invertible in the class of locally 
Lipschitz continuous causal operators. 
10. Let the time-interval of definition S be (— oo,+ oo). Let t be a real 
number. Then the translation operator , denoted by T T , is defined for 
xeW e by (T r x)(t) A x{t + r). An operator F from W e into itself 
is said to be time invariant on W 6 if for any t £ R 9 T T and F 
commute on W e . Let the time-interval of definition be [T ,ao) t Let 
t < be a real number. Then the translation operator, denoted by 
T r , is defined for xeW e by (T 7 x)(t) A x(t + r) for t + r > T , 
and otherwise. A causal 11 operator F from W 6 into itself is said 
to be time invariant on W e if T T and F commute on W e for all 
t < 0. The time-invariant operators form a subalgebra of 
^(W e9 W e ) which contains the unit. 

It is easy to verify that J?(W 6 ,W e ) is a regular subalgebra of 
Jf(W„W^. If S = (— oo,+ co) then the time-invariant operators form 
a regular subalgebra of ^(W e9 W e ). If S — |T 0> oo), then the time- 
invariant operators form a regular subalgebra of j¥* + {W 6 ,W^ and 
jV"{W e ,W^ t The memoryless operators also form a regular subalgebra 
ofjr+{W ei W e ) and J^(W e ,W e ). 

In the same way as for the algebra of operators from W e into itself, 
one can consider the algebra of operators from JFinto itself. Thus, let 
Jf(W,W) denote the class of operators from W into itself, i.e., 
Jf(W,W) A {x | x: W-* W). Let addition, scalar multiplication, and 
multiplication of elements be defined in the obvious way. Then there 
follows: 

Theorem 2.4 

The operators in JT{W,W) form a left-distributive algebra with a 
unit. 



11 Notice that time invariance is only defined for causal operators when S = 
[T ,oo). An alternative approach would be to define time invariance when S = 
[T ,oo) as the property of an operator with a time-invariant backwards extension 
to S = (—oo, + oo) (see Section 2.9). 
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The analogous subalgebras thus become : 

1. The class Jf+(W 9 W) of causal operators from Winto itself. 
The class Jf~(yV y W) of anticausal operators from Winto itself. 
The class Jf°{yy,W} of memoryless operators from Winto itself. 

2. The class J?(W>W) of bounded linear operators from W'mto itself. 

3. The continuous operators from Winto itself. 

4. The Lipschitz continuous operators from W into itself. 

5. The bounded operators from P^into itself. 

6. The time-invariant operators from W into itself (time invariance is 
defined completely analogously to that notion in W e ). 

Subalgebras 1-6 and any intersection of them contain the unit. 

7. The strongly causal operators from W e into itself form a subalgebra 
oiJf(W>W) which does not contain the unit. This class of operators 
is in fact a subalgebra of jV+(W>W) and an ideal in the class of 
Lipschitz continuous operators from PFinto itself. Thus, no strongly 
causal operator is invertible in the class of Lipschitz continuous 
operators from W into itself. 

£?{W,W) y J r °(y^ r ,W) 9 and the time-invariant operators with 
S = (— co, + oo) are regular subalgebras of ^(W^W). The algebra 
Jf°(yy,W) and the time-invariant operators with 5= |T ,oo) are 
regular subalgebras oiJf+(W y W) and ^(W.W). 

For the purposes of analysis it is often mandatory to work with 
Banach algebras rather than simply algebras. Operators which belong 
to Banach algebras can be "measured" and this leads to the possibility 
of establishing invertibility conditions for operators. Two important 
norms on the space of operators will be considered : the first one takes 
the Lipschitz constant of an operator as its measure and the second one 
takes the bound of an operator as its measure. 

Definition: Let 33(W,W) denote the space of Lipschitz continuous 
operators from W into itself which map zero into itself, and define the 
norm of an element F e $( W, W) by 

x.veW \\X — V 

Let addition, scalar multiplication, and multiplication of elements be 
defined in &(W,W) in the usual way. 



Theorem 2.5 
The algebra &(W, W)is a left-distributive Banach algebra with a unit. 
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Proof: The only element in the proof which is not immediate is the 
statement that .#( W> W) is a Banach space. Let {F n } , n e /+, be a Cauchy 
sequence in Jf( JF, #0. Then {F n x}, for any x e W, is a Cauchy sequence 

in W. Let Px a lim n -, fl0 F n x. Clearly F maps PF into itself and FO = 0. 
Since 

\\Fx - Fy\\ = lim \\F n x - F n y\\ < sup ||F„|| A ||x - y\\, 

II J* - F y\\ ^ 
sup - — < co, 

and Fe i(F,jr). It remains to be shown that \\F„ — F\\ A ->■ 0. 
However, since for x, y e W, x # ^, 

l |F„x - F M y - (f x - Fy)\\ 

\\x - y\\ 

_ \\F n x - F n y - lim,^,, (F m x - F m y)\\ 

II* - y\\ 
it follows that \\F n - F|| A < sup m> J|.F n - F m \\ A> which yields the 
convergence since {F n } is Cauchy in £8(W,W). 

Definition: Let @I(W,W) denote the space of operators from Winto 
itself which map zero into itself and define the norm of an element 
Fe@(W,W) by 

IIFUAsupT^. 
7 w ||x|| 

Let addition, scalar multiplication, and multiplication of elements be 
defined in £§{W,W) in the usual way. 

Theorem 2.6 
The algebra 3$( W 9 W) is a left-distributive Banach algebra with a unit. 

Proof: The proof requires only minor modifications from the proof 
of the previous theorem and is left to the reader. 

Important subalgebras of $(W>W) include: 
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2. 3?(W,W)', and 

3. the time-invariant operators in ^(W^W), 

The classes 1-3, or any intersection of them, form subalgebras of 
dt(W,W) that contain the unit. 
Important subalgebras of &(W,W) include: 

1. &+(W,W) a JT+(W,W) n £(W f W), 
&-{w,W) a jr-(w,W) n &(W 9 W) 9 
@°{W,W) a jr°(W,W) n &(W,W); 

2. £e{W,W)\zx& 

3. the time-invariant operators in £$(W 9 W). 

The classes 1-3, or any intersection of them, form subalgebras of 
38(W 9 W) which contain the unit. 

The only difficulty in showing that the above claims are correct occurs 
in the demonstration of closedness of the subspaces. This, however, 
follows readily by contradiction. The linear and the time-invariant 
operators again form regular subalgebras. 

Theorems 2.5 and 2.6 thus succeed in putting a Banach algebra 
structure on a large subclass of operators in jV{W,W). This result 
depends directly on the fact that WK itself is a Banach space, and hence 
cannot immediately be generalized to operators in ~<V(W e9 W^ 9 since 
W e is not normed. There are, however, more restricted subclasses of 
^(W e9 W e ) where such a possibility exists. 

Definitions: Let 3§ + (W e ,W^ denote the space of Lipschitz continuous 
causal operators from W e into itself which map zero into itself, and 
define the norm of an element F s &+(W e ,W e ) by 

|| P T Fx - P T Fy || 

||F|| A A sup ^ f-f- 

Tes 

Let &+(W e ,W e ) denote the space of Lipschitz continuous causal 
operators from W e into itself which map zero into itself, and define the 
norm of an element F e & + (W 6 ,W e ) by 

Ml A sup J&5! 

xZWe ||Pwd| 

P T x*Q 

Let addition, scalar multiplication, and multiplication of elements in 
&+(W ey W e ) and &+(W e9 W e ) be defined in the usual way. 
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Theorem 2.7 

The algebras ^ + (W e ,W e ) and @+(W e ,W e ) are left-distributive 
Banach algebras with a unit. 

Proof: Since by Theorem 2.1, &+(W e ,W ) and ^ + (P^,tfQ are 
isometrically isomorphic to respectively 3§ + (W,W) and 33+(W,W), 
the theorem follows from Theorems 2.5 and 2.6. 

Thus, as was shown in Theorem 2 J and pointed out in the proof of 
Theorem 2.7, &+(W ,W e ) is isometrically isomorphic to @+(W,W), 
and &+(W e ,W e ) is isometrically isomorphic to &+(W,W). As a 
consequence, the identical notation for the norms on these spaces is, 
albeit abusive, not a source of confusion. 



2.7 Contractions, Conic Operators, Positive Operators 

This section contains a number of concepts around which invertibility 
theorems will be established. Recall the assumption that for all operators 
F0 = 0. 

LetFe $(W,W).Then Fis seddtobez contraction on Wif ||Fj| A < 1. 
Let F e §§+(W e , W e ). Then F is said to be a contraction on W e if ||F|| A < 
1. Notice that by Theorem 2.1 the causal contractions on W stand in 
one-to-one relation to the contractions on W e . Let Fe &(W,W). Then 
Fis said to be attenuating on Wif \\F\\ < 1. Let Fe &+(W e ,W ). Then 
Fis said to be attenuating on W 6 if ||F|| < 1. Note that by Theorem 2.1 
the causal attenuating operators on W stand in one-to-one relation 
with the attenuating operators on W e . 

Definitions: Let r be a nonnegative real number and let c be a scalar. 
Then Fe !$(W,W) is said to be incrementally {strictly) interior conic 
on W with center c and radius r if ||F— c/|| A < r«r). Let Fe 
JV*(W,W). Then Fis said to be incrementally (strictly) exterior conic on 
W with center c and radius r if for all x, y e W, ||(F— cT)x — 
(F - c/)j|| >r\\x- y\\ (for some e > 0, >(r + e) ||x - 7II). Let 
Fe 0£(W,W). Then Fis said to be (strictly) interior conic on W with 
ce/i/er c and radius r if ||F - cl\\ < r«r). Let FeJf(W,W). Then F 
is said to be (strictly) exterior conic on W with center c and radius r if for 
all x e *F, || (F - c/)jc|| > r ||*|| (for some e > 0, >(r + e) ||*||). 

The analogous notions can also be defined on W e . Thus, let r be a 
nonnegative real number and let c be a scalar. Then F e J r+ (W e ,W e ) 
with FO = is said to be incrementally interior (strictly interior, 
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exterior, strictly exterior), conic on W with center c and rfifc&w r if for all 
x,yeW e and all Te 5, ||F r (F - c/)* - P T (F - c%|| < r \\P T (x - 
jOH (for some € > 0, <(r - e) ||P r (jc - »||, >r ||P^ - )0il, 
>(>" + e) \\Pt(x — jOID- ^ i s sa *d to be interior (strictly interior, 
exterior, strictly exterior), conic on W e with center c and radius r if for all 
jc e JF e and FeS, \\P T (F - cl)x\\ < r \\P T x\\ (for some e > 0, 
<(r-e)i|F T *||, >r\\P T x\\, >(r + e) ||P r x||). 

Conicity on JF 6 thus essentially refers to conicity on W of the operators 
P T FP T for all T e S. In many cases of interest the space W which is 
assumed to be a Banach space is actually a Hilbert space. The case in 
which W is a Hilbert space is therefore studied more intensively. The 
following concepts will be useful for this purpose. 

Definitions: Let W be a Hilbert space, and let a and b be scalars. Then 
FeJf(W,W) is said to be incrementally inside (outside) the sector 
[a,b] on W if for all x,yeW, 

Re <(F - al)x - (F - al)y, (F - bl)x - (F - bl)y) < (>0). 

It is said to be incrementally strictly inside (outside) the sector [a,b] on 
W if for some e > 0, and all x, y e W 

Re <(F - *Z)x - (F - a/)j>, (F - M)x - (F - W» 

<-e||x-j|| 2 Oellx-jU 2 ). 
It is said to be inside (outside) the sector [a,b] on W if for all xeW, 

Re <(F - a/)*. (*" - W)*> < (>0). 

It is said to be strictly inside (outside) the sector [a,b] on W if for all 
xeW, 

Re <( JT - */)*, ( F _ bI)x) < _ € || X ||2 (>€ || x p )t 

The analogous notions on W e are defined as follows: Let W be a 
Hilbert space. Let a and b be scalars. Then F eJf+(W e ,W ) is said to 
be incrementally inside (strictly inside, outside, strictly outside) the sector 
[a,b] on W e if for all x,yeW e and TeS, 

Re (F r (F - al)x - P T (F - aI)y,P T (F - 6/)jc - P r (F - bl)y) 

< (for some e > 0, < - e ||P r O - ;>)|| 2 , 

>0, >6||F r (x-7)|| 2 ). 
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It is said to be inside {strictly inside, outside, strictly outside) the sector 
[a 9 b] on W e if for all x e W e and Te S 

Re (P T (F - al)x, P T (F - bl)x) < 

(for some e > 0, < - c ||iVc|| 2 , >0, >€ ||P^|| 2 ). 

It is clear that contractive operators are particular cases of conic 
operators (with center and radius 1). Furthermore, interior conicity 
on W 6 and W are by Theorem 2.1 equivalent notions for causal 
operators. This, however, is not the case for exterior conicity. 12 

For operators on a Hilbert space, it is in general easier to verify 
sector conditions than conicity conditions. The following theorem 
establishes a relationship between conic operators and sector operators. 

Theorem 2.8 

Let W be a Hilbert space and let F e JT(W, W) (or F s JT+(W„ W,)). 
Then F is (incrementally) (strictly) interior (exterior) conic on W 
(or W e ) with center c and radius \r\ if and only if Fis (incrementally) 
(strictly) inside (outside) the sector [a,b] on W (or W e ) with a = c — r 
and b = c + r. 

Proof: Since 

Re ((F - al)x, (F - bl)x) = Re <(F - c/ + r/)x,(F - c/ - r/)x> 

= ||(F-c/)x|| 2 -Iri 2 W 2 , 
the result follows. 

It follows from Theorems 2.1 and 2.8 that the operators that are 
inside a sector on W e stand in one-to-one correspondence with the 
causal operators which are inside that sector on W. This correspondence 
again does not exist for causal operators which are outside a sector on W. 

An important role will be played in the sequel by positive operators. 
They are generalizations of nonnegative definite linear operators on 
Hilbert spaces and have been referred to as dissipative, passive, or 
monotone operators. 13 

Definitions: If W is a Hilbert space and FeJ r (W,W), then F is said 
to be incrementally positive on W if Re (x — y, Fx — Fy) > for all 

12 It suffices therefore to consider a time-delay on L 2 {~~ co, + oo). This operator is 
exterior conic with center and radius 1 on L 2 (— oo , -f- co), but not on L u (~- oo , + oo). 

13 An important reference for the implications of positivity to well-posedness 
problems for partial differential equations is Reference 7. 
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x,y eW. It is said to be positive on Wif Re (x 9 Fx) > for all x e W. 
It is said to be (incrementally) strictly positive on W if for some e > 0, 
the difference F — el is (incrementally) positive on W. F is said to be 
(incrementally) (strictly) negative on W if the operator —F is (incre- 
mentally) (strictly) negative on W* Let Fe ^^(W^W^. Then Fis said 
to be positive on W e if Re (P T x,P T Fx) > for all x e W 6 and Te 5. 
The notions of negativity, incremental positivity (negativity), and strict 
(incremental) positivity (negativity) on W e are defined in obvious 
analogy with those notions on W. 

The relationship between positive operators on W and W e is treated 
in the following theorem. 

Theorem 2.9 

Let FeJf+(W e9 W e ) and Jr+(WJV). Then F is (incrementally) 
(strictly) positive (negative) on W 6 if and only if F is (incrementally) 
(strictly) positive (negative) on W. 

Proof: Since P T is a projection onto a closed subspace of W for all 
T e S, and F is causal, it follows that for all x e W, 

{PrpXiPrpFx) = \PfpX,PrpFP^iX) = \P rpX ,FP rpX) \ 

this shows that positivity on W indeed implies positivity on W 6 . 
Conversely, assume that F is positive on W e but that for some x e W, 
(x,Fx)<0. Since lim T ^ o0 P T x = x, this implies that {P T x,Fx) — 
(P T x,P T Fx) < for some Te S. This contradiction ends the proof of 
the theorem. 

Remark: It is sometimes easier to perform certain calculations with 
S = (— oo, + oo) and then to draw conclusions for S = |T ,oo). The 
following considerations may be helpful in this regard. Let S = 
(— oo, + oo), let Wbe given (over S), and let F be an operator from W 
into itself. Let S' ~ IT ,oo) and let W be the space having S' as the 
interval of definition and consisting of those ^-valued functions on 
S f that when extended by zero on S — S f yield elements on W. Let 
HjcIIjp' be equal to the norm of the element of W that when projected 
on the subspace of functions with support on 5" yields x(t) as the value 
of x for t e S'. Let x' s W and x e W be defined by 



x(t) 



V(0 

.0 



for t e S' 
otherwise 



Let F*\W'-+ W be defined as (FV)(0 = (Fx)(t) for / e S'. It is clear 
that F' is well defined. 
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The space W and the operator F' will be called a restriction of W 
and F to S' = |T ,ao). The same procedure allows one to obtain a 
restriction on ^ of an operator on W e . 

The restriction of an operator preserves some of the essential 
properties of the original operator. These include, for instance, 
linearity, causality, and time invariance. It is easily verified that the 
restriction of an operator is (incrementally) positive if the original 
operator i$ itself (incrementally) positive. The same holds for interior 
(incremental) conic, contraction, and attenuating operators. Exterior 
conicity, however, is in general not preserved under this restriction 
unless the operator also happens to be causal. 



2.8 Conditions for Invertibility of Nonlinear Operators 

2.8.1 General Invertibility Conditions on W Involving Contractions, 
Conicity, Sector Conditions, and Fositivity 

This section contains general conditions for the invertibility of 
nonlinear operators on W. It should be remarked that at no point will 
the special structure of ^introduced in Section 2.3 be exploited. The 
invertibility conditions involve contractions, conicity, sector conditions, 
and positivity. All of these conditions are immediately related to the 
celebrated contraction mapping principle 14 and the conditions involving 
contractions and positivity have previously appeared in the mathe- 
matical literature. The conditions involving conicity and sector con- 
ditions are new, 15 and should, as sufficient conditions for invertibility 
of nonlinear operators, be of considerable interest in applied mathe- 
matics. 

The conditions of some of the theorems are on occasion rather 
involved, and it is therefore suggested that the reader concen- 
trate first on Theorem 2.11, Theorem 2.12, Corollary 2.14.1, and 
Theorem 2.15. 

It is worthwhile to mention that all the invertibility theorems which 
follow are in essence based on the contraction mapping principle. They 
thus yield as an important side aspect a convergent recursive algorithm 
for evaluating the inverse. In these invertibility theorems special 

14 See Reference 8, p. 34. For some folklore about fixed-point theorems, see 
Reference 9. 

15 Related, but much more restrictive, conditions have appeared in the mathemati- 
cal literature (Ref. 8, p. 296). The author draws his inspiration from the work of 
Zames (Ref. 10), who proves stability theorems using similar conditions. After the 
link between stability and invertibility is established (as will be done in Chapter 4) 
these invertibility theorems become apparent. Their derivation is of independent 
interest, however. 



CONDITIONS FOR INVERTIBILITY OF NONLINEAR OPERATORS 29 

emphasis is placed on whether or not a particular inverse belongs to 
the subalgebras to which the original operator belongs. The interest in 
this aspect stems mainly from the applications to instability conditions 
as will become apparent in Chapter 4, 

Consider the Banach algebra &(W,W) introduced in Section 2.5. 
Recall that 3§ + {W,W) is isometrically isomorphic to £+(W e ,W e ) and 
that invertibility conditions on &+(W B ,W e ) thus lead to identical 
invertibility conditions on & + (W eJ W e )* 

Theorem 2.10 

The algebra £%(W 9 W) and any of its subalgebras which contain the 
unit have the contraction property. 

Proof: Let F e 33(W 9 W) with \\F\\ A < 1. Then by the contraction 
mapping principle, I + Fis invertible on W and its inverse is Lipschitz 
continuous. In fact, 

and the solution to the equation x = — Fx + u with u e W given and 
x e W unknown can be solved by successive approximations with 
x n+1 = — Fx n + u, n e/+ The resulting sequence {x n } 9 with n el+, 
is a Cauchy sequence in W and converges for any choice of x e W to 
the unique solution (7 + F)~^u t This algorithm for computing the 
solution shows that any subalgebra of $(W 9 W) also has the contraction 
property. 

Theorem 2.10 immediately leads to the invertibility condition of 
Theorem 2.11. 

Theorem 2.11 

Let FeJ r {W 9 W) be a contraction on W. Then ~\$a{F) in 
&(W,W). Moreover, if F belongs to a subalgebra of ^{W 9 W) that 
contains the unit, then so does (/ + FY 1 , Finally, 



Theorem 2.12 

Let F l9 F 2 eJT(W 9 W). Then -1 ^a{F x F^ in J^{W 9 W) if there 
exists a scalar c such that —1 <£<r(cF 2 ) in Jf{W 9 W) and {F x — cT) 
F % {I + cFz)- 1 is a contraction on W. Moreover, (I + F^)' 1 e $(W t W) 
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if (J + cF 2 )' 1 e $i(W,W)> and if F x and F % belong to a subalgebra of 
£(W,W) that contains the unit, then (/ + i^)" 1 belongs to that 
subalgebra if and only if (J + cF^T 1 does. Finally, 



11(1 + W 1 !!* < 



||(i + cF^U 



1 - \\{F X - d)F 2 (7 + cFj 



fc — in 



P/w/; Since / + F^ - (/ + (F x - c/)F a (/ + cF^^I + cFj, the 
operator I + F X F^ is the product of two elements of JV{W,W) both of 
which are invertible in Jf{W,W) y and is thus itself invertible in 
Jf{W,W). To prove the second part of the theorem, express I + F X F 2 
as (/ + cF^ + CFi - c/)F 2 . Clearly, (/ + cF 2 ) + cl(F x - cI)F z is 
invertible in $(W,W) for all < a < 1 and the inverse for a = is 
(/ + cF 2 )~\ It thus follows from Theorem 2.2 that the inverse for 
a = 1, (/+ F X F^~ X , belongs to exactly those subalgebras to which 
(I + cFz)" 1 belongs. 

Corollary 2.12.1 

Let F l9 F 2 e JS(w,W). Then -1 $ a(F x F 2 ) in JT(W,W) if for some 
scalar c and r > 0, F x is incrementally strictly interior conic on W with 
center c and radius r, -1 $ <r(cF 2 ) in JT(W 9 W) 9 and F 2 satisfies for all 
x, yeW the inequality ||(/ + cF 2 )x - (/ + cF 2 )j/|| > r \\F z x - F a ;>||. 
Moreover, (/ + F x F 2 )~ l e S(W,W) if (J + cF^" 1 e @(W,W), and if 
Fi and F 2 belong to a subalgebra of &(W 9 W) that contains the unit 
then (7 + F X F^" X belongs to that subalgebra if and only if (I + cF^ 1 
does. Finally, 

ll(/ + fm L < 1 _ ||(Fi _ cI)yr < x _ l]Fi _ cjyr 



Proof: The conditions of the Corollary imply that \\F 2 (I + cF^)" 1 ^ < 
r- 1 which yields \\(F X - c/)F,(/ + cF^'Ia < 1- The Corollary then 
follows from Theorem 2.12. 

Note that if 0£<r(F 2 ) in Jf{W,W) then Corollary 2.12.1 requires 
that F^ 1 be incrementally exterior conic on W with center — c and 
radius r" 1 . 

Very often the space W of interest turns out to be a Hilbert space. 
In that case the formulation of Corollary 2.12.1 becomes considerably 
more elegant, particularly if expressed in terms of sector conditions. 
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Theorem 2.13 

Let F Xy F 2 e jV{W,W) with W a Hilbert space. Then -1 £ a(F x F 2 ) 
in jV*{W>W) if for some scalar c and r > 0, the operator F x is incre- 
mentally strictly interior conic on W with center c and radius r, 
— 1 £ g{cF^ in Jf{W>W)i and F 2 satisfies one of the following condi- 
tions : 

Case 1 : \c\ < r, and F 2 is incrementally interior conic on W with 

c . , . ,. r 



center 



r 2 - |c| s 



and radius 



r 2 -lcl 2 



Case 2: \c\ > r, and F 2 is incrementally exterior conic on W with 

c , r 



center 



|cl 2 -r 2 



and radius 



Case 3: |c| — r, and 7 + 2cF 2 is incrementally positive on W. 

Moreover, (/ + F X F 2 )~ X e &(W,W) if (/ + cFJ" 1 6 M{W y W), and 
if F x and F a belong to a subalgebra of M{W y W) that contains the 
unit, then (I + F X F 2 )~ X belongs to that subalgebra if and only 
if (I + cFa)" 1 does. Finally, 

\\(J -l. F F v-ill < IK* + cF 3 )- 1 11 a < 1 + Mir 
U(i + Xi**) Ha ^ : — rry ^ : — ^r~r 

1 - || F x - d\\Jr 1 - ||F X - d\\Jr 
Proof: If |c| # r then for all x,y e W the following identities hold: 

('-^^-('•-^r'» 



r 2 - c 2 



1 



r 2 — |c| a 

% 1 



II* - yf 



((r 2 - Icl 2 ) ||F,x - F a y|| 2 

- 2 Re c(x - y, F s x - F 2 y) - \\x - y\\*), 
-T^—^WFzX-Fzyf 

- ||(J + cF 2 )x - (/ + cF 2 )yf). 
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It thus follows from Theorem 2.8 that both cases 1 and 2 imply that 
||(7 + cF 2 )x - (7 + cF 2 )y\\ > r \\F 2 x - F^L which by Corollary 2.12.1 
yields the conclusions of the theorem for these cases. If \c\ ~ r then 
Re (x - y, (7 + 2cF 2 )x - (7 + 2cF 2 )y) > for all x, y e W. Hence 
||(7 + ci ? 2)x _ (/ + cF ^yY - |<f 11^ _ f ^y < o and ||(7 + cF 2 )x 
- (7 + cF^H 2 > r 2 ||iVc - T^H 2 which by Corollary 2.12.1 yields 
the conclusions of the theorem for case 3. 



When Theorem 2.13 is expressed in terms of sector conditions, it 
becomes 



Theorem 2.14 

Let F u F 2 e Jf(W,W) with W a Hilbert space. Then -1 $ a(F l F 2 ) 
in JfiyV.W) if for some scalars a, b, the operator F 1 is incrementally 
strictly inside the sector [a,b] on.W,—l$ a(i(a + b)F 2 ) in Jf{W,W) 
and F 2 satisfies one of the following conditions: 

Case 1: \a + b\ <\a — b\, and F 2 is incrementally inside the sector 
a b 



[ 



Reab 



Re aft 



ab\ 



on W. 



Case 2: \a + b\> \a — b\, and F 2 is incrementally outside the sector 

r__i i-i 

L Re ab ' Re ab] 

on W. 
Case 3: \a + b\ - \a — b\, that is, Re ab = 0, and 7 + (a + Z>)F 2 is 
incrementally positive on W. 
^Moreover, (7 + F^)" 1 e i?(PF, PF) if (7 + «a + ^a)" 1 e 
^{W t W), and if 2^ and F 2 belong to a subalgebra of ${W,W) 
that contains the unit, then (7 + F 1 F 2 y~ 1 belongs to that sub- 
algebra if and only if (7 + cF 2 )~~ x does. 

Proof of case I: The conditions on F z imply by Theorem 2.8 that F 2 is 
incrementally inside the cone with 



center 



a + b 
2R& ab 



and radius 



a-b 



2Reai 
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and is thus inside the cone with 
-(* + £>)/2 



center 



|(a + ft)/2| 2 -|(a-fe)/2| 2 
and radius 



1(3 - i)/2| 



| (a _ 6 )/2|*-|(a + W 
this yields the required result by Theorem 2.13. 



Proof of case 2: This case is proved in a manner similar to case 1 and 
the details are left to the reader. 

Proof of case 3: This is a direct consequence of Theorem 2,13. 

The case when a and b are real is of particular interest and leads to 

Corollary 2.14,1 

Let F l9 F % e Jf(W,W) with W a Hilbert space. Then -1 £ a (FJ?^ 
in ^{WyW) if for some real numbers a < b, b > 0, F x is incrementally 
strictly inside the sector [a,b], — 1 £ o{\{a + b)F 2 ) in JT{W,W)^ and 
7*2 satisfies one of the following conditions : 

Case 1 : a < 0, and F 2 is incrementally inside the sector [—1/6, — 1/#] 

on W. 
Case 2: a > 0, and .F 2 is incrementally outside the sector [— 1/cr, — 1/&] 

on *F. 
Case 3: a = 0, and jp a + l\b is positive on W. 

Moreover, (7 + F 2 F X )~ X e £(W,W) if (7 + i(* + ^J" 1 e 
&(W,W); and if i^ and F 2 belong to a subalgebra of S{W,W) that 
contains the unit, then (7 + F^)" 1 belongs to that subalgebra if and 
only if (7 + cF^Y 1 does. 

The following invertibility theorem is stated in terms of positivity 
conditions and will play an nnportant role in stability theory. It is an 
immediate consequence of the following important result that has 
recently appeared in the mathematical literature. 

Lemma 2.1 

Let FsJ r {W,W) be continuous on Wwith W a Hilbert space. If F is 
incrementally strictly positive on W then F is invertibie on ^{W.W) 
and F- 1 e ^{W,W) and is positive on W. Moreover, if Fe &(W,W). 
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then F" 1 is incrementally strictly positive on W, and belongs to all 
subalgebras of $l(W,W) which contain the unit to which F belongs. 
Proof: This lemma is a consequence of the monotone mapping theorem 
(see Reference 11). If Fe 3${W,W) then the inverse can be obtained as 
follows: Let e > be such that F — e/is incrementally positive on W, 
and let a = €/||il|. Then the operator G A / — <xF is a contraction 
on W and F~ x s= (/ — G)~ 1 od. This expression then yields the sub- 
algebraic properties claimed for F~K 

Theorem 2.15 

Let F u F z e Jf(W,W) be continuous on W> with TKa Hilbert space. 
Then —1 £ aCF^) * n ^{W,W) if i^ is incrementally positive on TF, 
i^ 2 is incrementally strictly positive on W, and F % is Lipschitz continuous 
or Fj is also incrementally strictly positive. Moreover, (/ + F 1 F^)r 1 e 
g$(W, W) and belongs to all subalgebras of £g(W 9 W) which contain the 
unit to which F x and F 2 belong. 

Proof: By Lemma 2.1 5 F z is invertible on W, and thus / + F^ = 
(j?-i + FJjpg. Since Fp + F x is incrementally strictly positive on W 9 
it is invertible on W. Thus / + F X F^ is the product of two invertible 
operators with inverses in $I(W,W). The statement about the sub- 
algebras follows from the expression of the inverse given by Lemma 2.1. 

Remark L Notice that the proof of Theorem 2.15 also yields as a side 
result that F 2 (I + F^y 1 is itself incrementally strictly positive and 
Lipschitz continuous. 

Remark 2. If Fis a causal, invertible (incrementally) positive operator 
on W, then a simple calculation shows that F" 1 is also (incrementally) 
positive. Some simple additional assumptions (see Refs. 12, 13) 
will then assure that F -1 is also a causal, invertible (incrementally) 
positive operator on W. These conditions lead to somewhat more 
general conditions of establishing the causality of inverses than the 
methods implied by the subalgebra considerations of Theorem 2.15. 

2.8.2 A General Invertibility Condition on W e Involving Strongly Causal 
Operators 

Section 2.8.3 is concerned with invertibility conditions for causal 
operators on W when S = [T , oo). Since most of the theorems in that 
section assume a priori invertibility on W e , it appears natural to 
present first a general condition for invertibility on W e . 
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Theorem 2.16 

Let S — [T ,co) and F be a strongly causal operator on W 6 . Then 
a(F) on Jf+iW^W^ consists of the zero element only. 

The proof of this theorem will be given in Chapter 4 when discussing 
well-posedness of feedback systems. 

Theorem 2.16 shows that invertibility on W 6 of the identity plus a 
causal operator is a very weak assumption when the time-interval of 
definition S is |T ,oo). This is a well-known fact for Volterra integral 
equations and differential equations. 



2.8.3 Invertibility Conditions on W for Causal Operators 

The question posed in this section is the following: Since, as pointed 
out in Section 2.8.2, invertibility of / + F on the extended space 
essentially comes "for free" when Fis b. causal operator, can this fact 
somehow be exploited to obtain weaker conditions for invertibility on 
the nonextended space? Many operators encountered in practice are 
causal, and it turns out that the causality of operators can indeed be 
quite successfully used in obtaining such invertibility conditions. The 
theorems which follow have implications in stability theory, as will be 
pointed out in Chapter 4. 

The invertibility conditions which follow are nonstandard in the 
mathematical literature and should be of considerable interest. The 
theorems are essentially restricted to causal operators, however. This 
does not detract from their value, since, in many fields — particularly 
mathematical system theory — the operators considered are more often 
than not causal. 

The invertibility theorems which follow are completely analogous to 
those obtained in Section 2.8.1. However, they dispense with incre- 
mental conicity or incremental positivity in favor of conicity and 
positivity. The conditions thus become a great deal less restrictive. 16 

It is extremely important, when proving the theorems that follow, to 
keep in mind the isometrically isomorphic equivalence of &+(W,W) 
and & + (W e ,W e ) as exposed by Theorem 2.1. 

16 However, the resulting theorems do not allow one to conclude the continuity of 
the inverse. It is also much less obvious how to construct a recursive algorithm for 
the computation of the inverse. This can be done, however. For instance, in Theorem 
2.17 one can use the invertibility on W e (which is algorithmic, as will be pointed out 
in Chapter 4) with the invertibility on W to obtain an algorithm, a bound on the 
error, and a rate of convergence for the solutions. The calculations in Reference 14 
may be helpful in this respect. 
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Theorem 2.17 

Let Fe @+(W e ,W e ), and -1 £ a(F) in jV+(W e ,W^. If F is atten- 
uating on JF (or W e ), then — 1<£<7(F) in &+(W,W). Moreover, 

\\a + F yi\\ < (i - \\F\\y\ 

Proof: Let ue W, and consider the equation x + Fx = w. This 
equation has a unique solution x eW e by assumption. Thus, P T x + 
P T Fx = F T w for all TeS, which yields 

11*2*11 - 11*11 11*2*1! < 11*2*11 ~ 11**2*11 

< 11*2*11 - ll*T**2*ll < 11*2*11 < N- 

Hence, ||*2*||< ||tt||/(l - ||*||) for all TeS, and jc 6 IF -with 
||je|| < ||m||/(1 - \\F\\). Thus (7 + F)~ l exists on W. It remains to be 
shown that (7 + F)" 1 is causal on W* This, however, follows since 
(J + FY X on W is the restriction of (7 + F)" 1 on W 6 to W. Since the 
latter inverse is causal by assumption, the theorem follows. 



Theorem 2.18 

Let F l9 F 2 eJ r +(W ei W e ) i and -1 $ a(F x F 2 ) in jT+(W„WJ. Then 
(7 + *i*a) -1 G ^ + {W,W) if there exists a scalar c such that — 1 $ 
a(cF 2 ) in J^+(W e ,W e ), (7 + cF 2 )~ x e 0+(W,W) and (F 1 -cl) 
F 2 (7 + cFJr 1 is attenuating on W. In fact, 

iKi + cf 2 t x \\ 



||(i + ^F.)" 1 !! < 



1 - ||(F X - c7)F 2 (7 + cF 2 )- 



Proo/; Since / + F X F 2 = [I + (F x - c7)F 2 (7 + cF^K/ + cFJ, the 
operator /+ F X F Z is the product of two elements of Jf+{W„W^* 
Both / + cF 2 and 7 + F X F 2 are invertible in Jf+(W„W^ by assumption 
which thus yields that 7 + (F x — c7)F 2 (7 + cF^)~ x is invertible in 
^V+(W e ,W e ). By Theorem 2.17, [7 + (F x - c7)F 2 (7 + cF,)- 1 ]- 1 actually 
belongs to ££ + {W,W)« The estimates in the theorem are obvious. 

Corollary 2.18.1 

Let F l9 F z eJr+(W ei W e ), and -1 £ a{F x F 2 ) in Jf+{W e9 W). Then 
(7 + F X F 2 )~ X e @+(W,W) if for some scalar c and r > 0, the operator 
F x is strictly interior conic on W with center c and radius r, — 1 $ 
<y(cF 2 ) in J^+(W e ,W e ), and rF 2 (7 + cFa)" 1 is attenuating on PT. In fact, 

11(7 + cF,)- 1 !! ^ l+|c|/r 



||(J + F^T'W < 



1 - \\F X - cJ||/r 1 - \\F X - c/||/r 
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Proof: Since (7 + cF a )(7 + cF 2 )~ l = Jon FF 6 , the inverse (7 + cFa)" 1 = 
/ _ cf 2 (j + C ir a )-i on PF„ which shows that (7 + cF^' 1 e &+(W,W) 
andrthat ||(/ + cF^W < 1 + \c\\r. Corollary 2.18.1 then follows from 
Theorem 2.18. 



Corollary 2.18.2 

Let Fx, F % ejV+{W„WX and -1 $a(F x F 2 ) in JT+(W„W \ Then 
(7 -|- FiFa) -1 e &+(W,W) if for some scalar c and r > 0, the operator 
Fi is strictly interior conic on W with center c and radius r, — 1 £ tf(cF 2 ) 
in Jf+{W„W^ 9 and F a satisfies for all x e JF and Je S the inequality 
r ||* r F,x|| < 11*2.(7+ cF 2 )x\\. In fact, 

1 + \c\/r 



||(7 + FxF,)" 1 !! < 



1 - ||F, - cl\\/r 



Proof: Since the conditions of the corollary imply that rF 2 (7 + cF^r x 
is attenuating on W, this theorem becomes a simple consequence of 
Corollary 2.18.1. 

Note that if 0£<r(F 2 ) in Jf + (W e ,W e ) then Corollaries 2.18.1 and 
2.18.2 require F% x to be exterior conic on W e with center — c and radius 
r" 1 . Note also that Corollary 2.18.2 does not require computation of 
(7+ cF)- 1 or even proving that (7 + cF)" 1 e @(W„W ). If this has 
been established independently, then the condition on F 2 becomes 
somewhat simpler and requires that F 2 satisy, for all x e W, the in- 
equality r \\F z x\\ < ||(7 + cF,)x||. 

Very often the space W under consideration is actually a Hilbert 
space; the formulation of the above corollaries becomes considerably 
more elegant if this fact is exploited, and the invertibility conditions of 
the previous theorems are expressed in terms of sector conditions. 

Theorem 2.19 

Let F l9 F 2 g jV+OVvWJ, with W a Hilbert space, and -1 $ a(F x F 2 ) 
in J^+iW^WX Then (7 + F^)" 1 e Oh-(JV 9 W) if for some scalar c 
and r > 0, the operator F x is strictly interior conic on W with center 
c and radius r, -1 $ <r(cF 2 ) in JT+{W„W^ and F 2 satisfies one of the 
following conditions: 

Case 1 : \c\ < r, and F 2 is interior conic on W with 



center 



M s 



and radius 



r - c| 
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Case 2: \c\ > r, and F 2 is exterior conic on TF 6 with 
c ... r 



center 



r 2 - |c| 2 



and radius 



| c |»_r»* 
Case 3 : |c| = r, and / +• 2cF 2 is positive on W e . 

1 + |c|/r 



In fact, |I(I + F 1 FJ- l \\< 



1 _ iFl _ c /|| /r ' 



P/w/; Combining the ideas of the proof of Theorem 2.13 and the 
result of Theorem 2.18 and its corollaries leads to this. result without 
difficulty. The details are left to the reader. 

Theorem 2.20 

Let F u F t e jV+(W ei W e ) 9 with Wz Hilbert space, and -1 £ a(F x F % ) 
in JT+(W 69 W^ Then (/+ F^)" 1 e &+(W,W) if for some scalars 
a 9 b, the operator F x is strictly inside the sector [a 9 b] on W 9 — 1 £ 
o-(l(a + *)*a) * n ^ + (W e ,W^ 9 and F 2 satisfies one of the following 
conditions: 

Case 1 : \a + b\ < \a — -b\ 9 and F 2 is inside the sector 

• r *—, L.] 

L Re afe ? Re a£J 

onW. 
Case 2: |a + b\ > |a — 6|, and F % is outside the sector 



L Re a£ ' Re ab\ 



on TF e . 
Case 3: \a + b\~\a — b\, i.e., Re a£ « 0, and / + (a + 6)F 2 is 
positive on W . 

The proof of Theorem 2.20 is entirely analogous to the proof of 
Theorem 2.14 and is left to the reader. 
The case when a and b are real is of particular interest and leads to 

Corollary 2.20.1 

Let F l9 F 2 e J^ + {W e9 W^ 9 with W a Hilbert space, and -1 £ a{F x F^ 
in JT+{W 69 W^. Then (/ + F X F^ e @+(W 9 W) if for some real 
numbers a < b 9 b > 0, F 1 is strictly inside the sector [a 9 b] on W 9 



CONDITIONS FOR INVERTIBILITY OF NONLINEAR OPERATORS 39 

— 1 £ <r(i^z + £)F 2 ) in Jf*(W e9 W e ), and F 2 satisfies one of the following 
conditions: 

Case 1 : a < 0, and F 2 is inside the sector [—1/6, — 1/a] on PF. 
Case 2: a > 0, and F 2 is outside the sector [— \\a 9 —l[b] on W . 
Case 3 : a = 0, and jF 2 + 7/6 is positive on W e . 

The invertibility theorems which follow involve positive operators. 

Theorem 2.21 

Let F l9 F 2 e JT+(W„W^ 9 with W a Hilbert space, and -1 £ a{F x F 2 ) 
in JT+(W„ W 6 ). Then (I + F^y 1 e @+(W 9 W) if F t is positive on W 6 
and F 2 is strictly positive and bounded on W. 

Proof: Let ueW 9 and consider the equation x + F x F^x = w. This 
equation has a unique solution x e W e by assumption. Thus for all 
T e iS, P^x + Pt^iF % x — P y w. Hence by strict positivity and the 
Cauchy inequality, there exists an e > such that for all Te S 9 

e \\P T x\\ 2 < KPrF^PyX + P T F x F % x)\ = KPyF 2 x,Pyw)| 
< ||P y F 2 *|| HP^II < ||F 2 || ||P^|| \\P T u\\. 

Consequently \\P T x\\ < e" 1 ||F 2 || ||u|| for all TeS which shows that 
x e Wand \\x\\ < e" 1 ||F 2 || ||w||. It remains to be shown that (/ + F X F Z )- X 
is causal on W. This, however, follows since (I + F^F^f 1 on Wis the 
restriction of (7 + F^' 1 on W 6 to W. 



Remark: If in addition £ tf(F 2 ) in ^ + (W e ,W e ) (which will happen 
under very weak conditions since F 2 is causal and strictly positive), 
then F 2 (7 + F^)" 1 turns out to be itself strictly positive and bounded 
on W. This fact has a rather interesting interpretation in terms of 
passive systems. 17 Indeed, consider F 2 and F x as expressing, respectively, 
the driving-point admittance and the driving-point impedance of a 
passive network. Then F%{J + F^)* 1 is the driving-point admittance 
of the parallel combination shown in Figure 2.1, which is itself clearly 
passive if F x and F 2 are passive. The reader will be interested in verifying 
that Theorem 2.21 can be refined by requiring positivity and bounded- 
ness of F 2 (rather than of F x ) and merely positivity of F l9 or by 
requiring strict positivity on F x and F 2 but no boundedness. 

17 This interpretation is due to Zames (Ref. 10). 
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Figure 2.1 A Realization of F 2 (l + F x F 2 y x 



2.9 Conditions for Noninvertibility of Nonlinear Operators 

Section 2.8 has been concerned with establishing conditions under 
which nonlinear operators are invertible. The present section addresses 
itself to the opposite question: namely, under what conditions can it be 
guaranteed that a particular operator is not invertible? The interest in 
this question stems primarily from the fact that noninvertible operators 
lead to unstable feedback systems, a fact which will be explored* in 
detail in Chapter 4. 

A great deal of the mathematical literature concerning nonlinear 
operators is devoted to the question of invertibility of operators. Some 
such results were presented in the previous section. Quite surprisingly, 
virtually no research has been devoted to exploring sufficient conditions 
for the noninvertibility of operators. The present section constitutes 
such an effort and contains quite specific conditions for operators to be 
noninvertible. 18 

First of all, note that the invertibility theorems obtained in Section 
2.8 give conditions for certain operators not to be invertible in a given 
subalgebra. In particular, Theorem 2.12 thus implies 

18 The estimates which go into the proofs of the noninvertibility theorems are 
entirely analogous to those used in the invertibility theorems of Section 2.8. The 
inspiration for these noninvertibility theorems stems from the relationship between 
noninvertibility and instability as explained in Chapter 4, The instability theorems 
that led the author to suspect the specific conditions are those obtained using 
Lyapunov theory in Reference 15. Notice also the crucial role played by Theorem 
2.2 in the proof of the invertibility theorems which follow. 
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Theorem 2.21 

Let F x , F 2 e &h-(W 9 W). Then -\ea(F x FJ) in jV+(W,W) if there 
exists a scalar c such that -l£(cF 2 ) in &(W 9 W), (I + cFJr 1 $ 
jV+(W t W) 9 and if {F x - cI)F 2 (I + cF^ 1 is a contraction on W. 

Note that Theorem 2.21 and its corollaries, which can be obtained 
by making the contraction condition more specific (as in Corollary 
2.12.1, Theorem 2.13, Theorem 2.14, and Corollary 2.14.1) do state 
sufficient conditions for a particular causal operator / + F X F Z not to be 
invertible in the algebra of causal operators. This is done by showing 
that / + F X F % is invertible but by ensuring, through Theorem 2.2, that 
this the inverse is not causal on W. Theorem 2.21 is predicated on the 
possibility of showing that / + cF % is invertible on Wfor some scalar c, 
but that this inverse is not causal. 

Remark 1; The preceding statement trades the problem of showing 
that a particular inverse (/ + F X F^ is not causal for a similar one, 
namely, showing that (/ + cF^jr x is not causal. In general, however, 
the operator F = F X F% is given and F z can be selected to convenience 
(modulo the condition F — F X F 2 ). Very often F 2 can thus be chosen to 
be relatively simple as compared to F, e.g., linear and time invariant, 
so that much more can be said a priori about the inverse (J + cF 2 )~ x 
than about the inverse (/ + iV^)" 1 . 

Remark 2: When S — [r ,oo), very weak conditions (e.g., strong 
causality of F X F^) will ensure that / + F X F 2 is invertible in JT^{W^W^ 9 
and whenever (J + F X F^~ X exists in J^(W,W), it will thus necessarily 
be causal (as the restriction of this inverse on W e to W), Theorem 2.21 
will then, of course, never succeed in showing that / + F X F 2 is not 
invertible in Jf+(W,W). A method, believed to be innovative, of 
treating this case is outlined in the remainder of this section. 

Definition: Let S = [T^oo), W (defined over 5), and FeJ^+(W e9 W e ) 
be given. Let W consist of 5-valued functions defined on S f = 
(_ oo,+ oo) and F' e ^(W'^W^. Then S\ W\-an& F' are said to be a 
backwards extension 19 of 5, W, and F if they satisfy the following 
hypotheses: 

1. The space W" satisfies the assumptions W.1-W.4 enumerated in 
Section 2.3. 

19 Compare this with the definition of the restriction of an operator to a smaller 
time-interval of definition, as explained in Section 2.7. In fact, F is a restriction of 
F' if and only if F' is a backwards extension of F. 
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2. If x e W, then the ^-valued function 

x(0 for teS 

,0 for t e (S' - S) 



x'(0 = 



belongs to W. 

3. If x' e W\ then the 5-valued function x with *(f) = x'{t) for f e 5 
belongs to WK 

4. The operator F' is an element of ^HW^W'J. 

5. Given any x' e W with x'(0 = for t < T and jc e JT with 
x(f) = *'(*) for t e S, then FV(r) = F#(f) for / e S. 

The following lemma plays a crucial role in the results that follow 
and is believed to be of some interest in its own right. It essentially 
states that if a Lipschitz-continuous operator is invertible on one 
half-line, then it is invertible on any half-line. 

Lemma 2.2 

Let t l9 t % e S be given, Fe &+(W e ,W 6 ), and let F be invertible in 
^HW e9 W e ). Then F has an inverse on W H A {x eW\P h x = 0} */ 
and only ifF has an inverse on W i% A {x e W \ P t2 x = 0}. 

Proof: Note that F has a causal inverse on W t±e and W t%6 since it has 
a causal inverse on W e by assumption. Let t x < r 2 . If F has an inverse 
on W ti , then the restriction of this inverse to the subspace W t of W t 
clearly qualifies as the inverse on W tz since by causality this inverse 
maps W h into itself. Conversely, assume that F is invertible on W h 
and let u e W H be given. Let e — F~ x u e W He be decomposed as 
e = P^e + {I — P t ^e. Thus P h e e W. Since F is Lipschitz contin- 
uous, it follows that \\P T (Fe - F(/- P h )e)\\ < i|F|| A ||i\eii for all 
TeS,T> t %9 which shows that v A Fe — F(7 - i\ 2 )e e *F. Hence 
F(/ — P t2 )e — Fe — v = u — v eW. Since w — v s W t2 and Fis invertible 
on W t%i this thus yields (/ - P h )e - F^O - t?) e JF V Thus e = F (2 e + 
(J — P t2 )e e PP, which ends the proof of the lemma. 

Theorem 2.22 

Let5 = [r ,oo),Fe &+(W e ,W e ), and Fbe invertible in J^^(W ei W e ) t 
Let W and F'e $+{W' e ,W' e ) be a backwards extension of W and F 
from S = [r ,oo) to 5' = (— oo,+ oo), and assume that for all TeS' 
the operator F' has a causal inverse on W' Te a {x g HKJ | F^ = 0}. 
Then F" 1 £ Jf{W 9 W) if F' has a noncausal inverse on TF^ . 
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Proof: Since F' has a noncausal inverse on W', there exists at least one 
TeS' such that F' is not invertible on W' T A{xeW'\ P T x = 0}. This 
then implies by Lemma 2.2 that F' is not invertible on W$ A 
{xeW'\ P To x = 0}. Since W and F' are backward extensions of W 
and F, the spaces W' Tq and W are isomorphic and F' is equivalent to F 
in the sense that F' and Fmap equivalent elements x f e W' Tq and x e PF 
into equivalent elements FV e P^ o and Fx e W. Thus Fisnot one-to- 
one and onto W if F' is not one-to-one and onto W. Hence F is not 
invertible on W as claimed. 

Theorems 2.21 and 2.22 can then be combined to give a rather 
concrete noninvertibility theorem. More specifically, they yield 

Theorem 2.23 

Let S=*[T 09 co) 9 Fe$+(W e ,W e ), and let F be invertible in 
^+(W e ,W e ) t Let W and F'e 0h(W'„W') be a backwards extension 
of W and F from S = [T ,co) to S f = (-oo,+ oo), and assume that 
for all TeS', F' has a causal inverse on W' Te A {x s W' e | F r x — 0}. 
Let F[, F^ e #-(HT,JT0 and F' = I + F^. Then F" 1 ^ JT(W,W) if 
there exists a scalar c such that - 1 $ a(cFQ in £(W',W), (I + cF^)" 1 £ 
JT+(W\W), and if (F{ - c/)F^(7 + ci^)- 1 is a contraction on JF'. 

It is again possible to make the contraction condition of Theorem 
2.23 more specific and obtain the analogues of Corollary 2.12.1, 
Theorem 2.13, Theorem 2.14, and Corollary 2.14.1. These details are 
left to the reader. Notice, however, that only the case of exterior 
conicity or outside the sector conditions on F 3 will then lead-to non- 
invertibility conditions. 

It should be emphasized at this point that it is in general not difficult 
to obtain suitable backwards extensions of particular operators. For 
time-invariant operators the obvious choice to consider is the (unique) 
backward extension which itself is time invariant. Examples illustrating 
the use of Theorem 2.23 will be given in connection with instability 
theorems in Chapter 5. 
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3 Positive Operators 



3.1 Introduction 

This chapter is devoted to the study of positive operators. An operator 
will be called "positive" if the real part of the inner product of any 
element and its image under the operation is nonnegative. For example, 
a linear transformation from a finite-dimensional real vector space into 
itself defines a positive operator if and only if the matrix associated 
with this linear transformation plus the transpose of the matrix is 
nonnegative definite. The Sylvester test yields a simple necessary and 
sufficient condition for a finite-dimensional linear transformation to 
define a positive operator. For nonlinear transformations or operators 
defined on infinite-dimensional spaces, the situation is quite different 
and this is where the techniques and results developed in this chapter 
are useful. 

Why are positive operators important ? There are several areas both in 
engineering and in applied mathematics where positive operators play 
a central role. Here are some examples : 

1. Many techniques — e.g., in the theory of optimal control, in 
prediction theory, and in stability theory — require at some point in the 
analysis that a certain function or functional be positive definite; instances 
of this are the second variations in optimization theory and Lyapunov 
functions and their derivatives in stability theory. This procedure often 
reduces to one of verifying that a certain appropriately chosen 
operator is positive. In this context, it suffices to recall how often 
the positive definiteness of certain matrices is invoked. 
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2. Another area of research where positive operators play an 
essential role is network synthesis. Recall that a ratio of polynomials in 
the complex variable s is the driving-point impedance of a two-terminal 
network that can be realized using a finite number of positive, linear, 
and constant resistors, inductors, and capacitors, if and only if this ratio 
of polynomials is a positive real function of s. This result thus identifies 
the input-output relation of these passive networks with a class of positive 
operators. 1 It leaves no doubt that positive operators will also play 
an essential role in the synthesis of nonlinear and time- varying networks 
using certain passive devices. 

3. An important application of positive operators is in establishing 
the stability of feedback systems. Roughly speaking," stability is the 
property of systems whereby small inputs or initial conditions produce 
small responses. The technique for generating stability criteria for 
feedback systems from knowledge of positive operators will be 
examined in detail in Chapter 4, but the basic idea is simple and states 
that the interconnection of passive systems (positive operators) yields 
a stable system. 

4. The so-called frequency-power formulas have been applied to the 
design of parametric amplifiers. They are formulas which constrain 
weighted sums of real and reactive powers entering a device at various 
frequencies to be either zero, positive, or negative. The device could be, 
for instance, a nonlinear resistor, inductor, or capacitor. This work 
was initiated by Manley and Rowe, who analyzed the power flow at 
various frequencies in a nonlinear capacitor and discovered the now- 
famous Manley-Rowe frequency-power formulas. Manley and Rowe's 
work has been extended in several directions and the resulting formulas 
have found application in the design of frequency converters. Frequency- 
power formulas establish fundamental limits on the efficiency of such 
devices. These formulas have been applied in energy conversion 
using parametric devices, in studies of hydrodynamic and magneto- 
hydrodynamic stability, and in many other areas. In trying to bring 
certain methods and results in these areas into harmony, it became 
apparent that these frequency-power formulas are essentially particular 
classes of positive operators and can, mathematically at least, be most 
easily understood as such. 

5. Another important application of positive operators is the 
determination of bounds on the optimal performance of nonlinear 
time-varying systems. One of the crucial problems in optimal control 

1 This relationship between positivity of operators and passivity of systems has 
motivated many authors to call positive operators passive or dissipative. 
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theory is, paradoxically, the design of suboptimal systems. In fact, 
because of computational feasibility and more convenient implementa- 
tion, it is in many cases necessary to resort to suboptimal systems. Little 
or no attention has been paid to the problem of a priori predicting how 
far a suboptimal system is from being optimal. It can be shown 2 that 
the requirement that a given system has a better performance than 
another with respect to some performance criterion can in many 
important cases be reduced to requiring that a certain suitably chosen 
operator be positive. This then allows one to estimate a priori bounds 
on the performance of certain systems and to design feasible sub- 
optimal controls. In this respect it is also worthwhile to mention that 
optimal control provides, conceptually at least, a way of verifying the 
positivity of an operator F. Indeed if inf. Re (x,Fx) > 0, then the 
operator is clearly positive. 

6. From the purely mathematical point of view, positive operators 
are important because of their implications about the invertibility of 
nonlinear operators. These aspects have been explored in Chapter 2. 

The first class of operators which are examined in this chapter for 
positivity are the convolution operators and the memoryless operators 
in which tne output is an instantaneous function of the input. These 
positive operators lead to the well-known Manley-Rowe equations and 
play an important role in stability theory since they are closely connected 
with the Popov criterion and the circle criterion for the stability of 
nonlinear and time-varying feedback systems. 

Next, attention is focused on the question what class of linear 
convolution operators can be composed with a positive periodically 
time-varying linear multiplicative gain and still yield a positive operator. 
The answer to this question turns out to require that this convolution 
operator itself be positive and that the kernel of the convolution should 
consist of a string of impulses occurring at multiples of the period of 
the time-varying gain. It is also shown that this result is the best of its 
type. 

In Section 3.2, an answer to the following question is sought: What 
is the most general linear operator that when composed with a monotone 
nondecreasing (or an odd-monotone nondecreasing) nonlinearity 
yields a positive operator? This problem has received a great deal of 
attention in the past, in connection with both the frequency-power 

2 The topic of a priori bounds of suboptimal controllers is treated in Reference 1 . 
This work brings out the relevance of positivity conditions to this important area of 
research. 
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formulas and the stability of feedback loops with a monotone non- 
linearity in the feedback loop. The resulting class of positive operators 
is closely related to certain classes of matrices, i.e., the dominant 
matrices, which are important in network synthesis. The mathematical 
reason for this connection, however, remains vague and deserves 
further investigation. As an intermediate step in deriving this class of 
positive operators a considerable generalization of a classical inequality 
due to Hardy, Littlewood and Polya on the rearrangement of sequences 
is obtained. It is felt that the extension of this rearrangement inequality 
is of intrinsic importance in itself and has applications in other areas of 
system theory. 

Section 3.8 is devoted to the problem of adjoining to a positive 
operator a causal positive operator. It is shown that it is possible to 
adjoin a causal positive operator with an arbitrary positive operator 
provided the operator admits a suitable factorization. Whether a 
particular operator satisfies this condition appears to have no general 
answer, and the problem is one of considerable interest and importance. 
Similar factorizations have received a great deal of attention in the past, 
particularly in the classical prediction-theory literature. In this section 
a general factorization theorem is presented which is felt to be quite 
general and of intrinsic importance. The result, which is based on 
contraction arguments, unfortunately does not offer a necessary 
condition and is rather conservative in some particular cases. 

Recall the definition of a positive operator: Let X be an inner product 
space and let F be a mapping from Do (F) <= X into X. Then F is said 
to be {incrementally) positive on Do (F) if for all x e Do (F) {x,y e 
Do (F)), Re (x 9 Fx) > (Re (x —y,Fx — Fy) > 0). It is said to 
be {incrementally) strictly positive on Do (F) if for some e > 0, F — el 
is (incrementally) positive on Do (F). 

Some elementary properties of positive operators are: 

1. The (incrementally) positive operators form a cone; i.e., if F ± and F 2 
are positive on Do (F x ) and Do (F 2 ), respectively, then F x + F 2 is 
positive on Do (Fj) nDo(F 2 ), and if a is a nonnegative real 
number, then aFj is positive on Do {F t ). 

2. The (incrementally) positive operators are closed under inversion; 
i.e., if Fis positive on Do (F), then F -1 is positive on Do (F _1 ). 

3. The (incrementally) positive linear operators are closed under the 
adjoint operation; i.e., if L is a linear operator from Do (L) into X 
and if L* denotes the adjoint, then L is positive on Do (L) if and 
only if L* is positive on Do (L*). 
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4. If F is positive on its domain and L is linear then L*FL is positive 
on its domain. 



3.2 Positive Convolution Operators and Positive 
Memoryless Operators 

The space under consideration in this section is L^{S) (denoted 
simply by L 2 when no confusion can occur) with H a given Hilbert 
space and S = (— ao,+ oo). The case H = R is of particular interest 
and will be the subject of a number of corollaries. As mentioned in 
Chapter 1, the space L % is a Hilbert space with (x,y) Lz A JiS (x{0> 
y{t)) H dt. Recall that the limit-in-the-mean transform of x, defined by 
£{jco) A l.i*m. JiS x{t)e" 3<ot dt, is then, for any x e L 2 , a well-defined, 
#-valued function of a>, co e R, which itself belongs to Lg{— oo,+ oo). 
Moreover, the transform of X = x jlir and Parseval's equality states 
that {x,y) L2 = 27r<i", f ) ia for all x, y e L 2 . 

Definitions: Let ^ denote the class of operators from Z, 2 into itself 
defined as follows: associated with each element G e & is a class of 
operators G{jco) e^{H,H) 9 parametrized by co s a> e R, with(j(ya>) £!«,, 
and which maps the element x e L % into the element of L % with limit- 
in-the-mean transform G{ja})Jt{jco). 

It is simple to verify that the operator G is indeed well defined; i.e., 
that it maps L 2 into itself. An important subclass of operators of this 
type are the convolution operators, which are defined as follows: let 
g(0, t g R, and g k9 k e /, be elements ot&{H,H) with {\\g(t)\\ ,{\\g k \\}) e 
L x x l l9 and let {t k } be a mapping from / into R. Let y{t) = {Gx){t) 
be formally defined by 

y{t) A 2 g k x{t -O + l g(t - t)x{t) dr. 



Lemma 3.1 

The convolution operator G formally defined above maps L a into 
itself. Moreover ffef and the function G{ja>) associated with G is 
given by 



+ 00 f+co 

G(» A 2 fit*-'"* + gWe-*" dt. 

fc=_-00 J —CO 



This is a standard result from Fourier transform theory (see Ref. 2, 
p. 90). 3 Note that G is causal on L 2 if and only if g{t ) = for almost 
all t < and t k > for all keL 

3 Lemma 3.1 remains valid if g{t) e L 2 and {g fc } E U both have limit-in-the-mean 
transforms in£«>. 
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Theorem 3.1 

Every element Ge? defines a time-invariant bounded linear 
operator on L 2? ||G|| = \\G(jco)\\ La3 , and G is a positive operator on L 2 
if and only if for almost all co e R, G{joS) is (pointwise) positive on H. 
Moreover, (7* e & 9 and has the function G*(jco) associated with it. 

Proof: The theorem is obvious with the possible exception of the 
positivity condition and the time invariance. Positivity follows from 
Parseval's equality since 



Re <x(f),Gx(*)> 



Z7T J~ oo 



Re <J?<», G(jco)Z(jco)) H do. 



Time invariance is proved as follows: IctreR, xe L 2 , and (T T x)(t) A 
x(f + r). Then Gr r x = G(ja))£(ja))e j<0T which clearly equals T T Gx. 

Corollary 3.1.1 

Let H = R and G(jco) = G(—ja)). Then G is positive on L 2 if and 
only if Re G(jai) > for almost all co > 0. 

Definition: Let Jf denote the class of operators 4 from L a into itself 
defined as follows. Associated with each element K e JT is a class of 
operators K(t)e&(H,H) 9 parametrized by /, teR, with K^eL^, 
and which maps the element xeL 2 into the element of L 2 defined by 
(Kx)(t) A K(t)x(t). From this definition there follows 

Theorem 3.2 

Every element K e JT defines a bounded memoryless linear operator 
on Z^, ||JC|| = \\K(t)\\ L<:o , and JHs a positive operator on L 2 if and only 
if K{t) is (pointwise) positive on H for almost all t e R. Moreover, 
K* ejf and has the function K*(t) associated with it. 

Corollary 3.2.1 

If H = R 9 then every element K e Jf is self-adjoint and i£Tis positive 
on L 2 if and only if K(t) > for almost all t e R. 

* The classes ^ and jf are dual in the sense that both constitute pointwise multi- 
plications; the former in the frequency domain and the latter in the time domain. 
This duality permeates the recent work on stability from an input-output point of 
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Definition: Let F denote the class of operators from L 2 into itself 
defined as follows. Associated with each element FeF is a class of 
operators F(-,0> parametrized by t, t e R, each mapping H into itself, 
such that there exists a constant M < oo such that \\F(x,t) || < M ||x|| 
for all x e H and / e jR, and which maps the element xeL a into the 
element of L 2 defined by (Fx)(t) A F(x(t),t). There follows 

Theorem 3.3 

Every element FeF defines a bounded memoryless operator on 
L 2 , \\F\\ = M\ with M' == inf M over all M such that for all x e H and 
almost all t e S 9 \\F(x 9 t)\\ < M \\x\\; i.e., ||F|| - (IIFCOII)^, and F is 
(incrementally) positive on L a if and only if F(-j) is (incrementally) 
positive on H for almost all t e R. The operator F is Lipschitz con- 
tinuous on L 2 if JF(sO is Lipschitz continuous for almost all /eS, 
uniformly in t, and ||F|| A = (IIF^OIDi^ 



3.3 Memoryless Time-Invariant Nonlinear Operators 

An important particular class of operators in F are those for which 
F(-,t) is constant for almost all t e & The operator FeF associated 
with F(',0 then becomes time invariant. Attention is now focused on 
this case, with H = R. It turns out that a number of interesting 
relations can be obtained. These results play in fact an important role 
in the design of frequency converters where they are known under the 
name of the Manley-Rowe equations and in the stability of nonlinear 
feedback systems where they lead to the Popov criterion. 

The operators under investigation are thus the time-invariant 
operators in F with H = R and S — (—00, + 00). Let this class be 
denoted by F. Each element of F has thus associated with it a map 
/(■) from the real line into itself, satisfying, for some constant M < 00 
and all a e R, the inequality |/(<r)| < M \a\, and FeF maps L 2 into 
itself according to (Fx)(t) = fix(t)). 

Definition: A function x from R into itself is said to be absolutely 
continuous if for any integer N and any sequence {t k }, k = 1, 2, . . . , N 9 

iV-l N-l 

2 I *('*) - *( Wi) I — whenever 2 I h " '*+i I — °- 

A classic result in analysis states that a function is absolutely con- 
tinuous if and only if x(t) = x(a) + p a r(t) dt for some function 
r(t) e L x (a,b). The function x(t) is then differentiable almost everywhere, 
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and r(t) = x(t) for almost all t. Let S\{aJ>) be the subspace of L % (a,b) 
formed by the functions on [a,b] that are absolutely continuous and 
that belong, together with their derivatives, to L % (a,b). Let S\ denote 
Sl(— oo,+ oo). The space S\ is an inner product space with the inner 
product as in L 2 . It is, however, not closed in the L 2 -topology. 

Lemma 3.2 
If x e Si then lim^^ x(t) = 0. 

Proof: Write JT^ x(t)x(t)dt = M* 2 <T a ) ™ * 2 (-7\)]; it follows that 
the limits exist, since the limit of the integral for 7\ or T % -^ oo exists by 
the Schwartz inequality. Since lim^^ x{t) thus exist and since 
x(t) e £ 2 , the limits must indeed be zero. 

Theorem 3.4 

Assume that F e # and that the function/ that defines F is Lipschitz 
continuous on JR. Then (x, d(Fx)jdt) L ^ = for all x e Sg. 

Proof: Let 7(0 = Fx{t) and let K be a Lipschitz constant for/. Since/ 
is Lipschitz continuous, y(t) is absolutely continuous whenever x(t) is. 
It is simple to show that \y(t)\ < K \x{t)\ whenever both exist (and thus 
almost everywhere). The inner product in the theorem statement is thus 
well defined since y e S\. Integration by parts yields 



x(t)^y(t)dt^-\ /(, 

-oo at J— co 



x(t))-x(t)dt 



rx(T) 

= — lim f(a) da 

y->oo Jx(~ 



~T) 



= 



The last equality follows from Lemma 3.2. 

Remark: If x e S\, then the limit-in-the-mean transform of x exists and 
equals jco£(jco). Thus Theorem 3.4 merely states that for all x e L z and 
y = Fx 

r+ " j(DX(-jco)Y(ja>) dco = 0, 



r 



which is precisely the Manley-Rowe power-frequency formula 5 for 
elements of L 2 . 

5 The Manley-Rowe equation thus merely states that the vectors £ and Fx are 
orthogonal in £ 2 . 
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Combining Theorems 3.3 and 3.4 yields 



Theorem 3.5 

Let FelF, and assume that the function / which determines F is 
Lipschitz continuous on R. Let cceR and G e ^ be defined by G(jco) = 
1/(1 + a/co). Then FG is a nonnegative operator on L 2 if and only if 
for all a eR, of (a) > 0. 

Proof: The theorem is a particular case of Theorem 3.3 if a = 0. 
Let therefore a^O, Since the operator G corresponds to a convolu- 
tion, Gx is absolutely continuous for all x e L 2 . Moreover, since 
jcoftl + ajoti) e L OT for a 5* 0, Gx e Si for all x e L 2 . Thus 



(x,FGx) 



=<(-*!) 



Gx,FGx 



\ 



d 
dt 



= (Gx,FGx) + <x( — Gx,FGx ) >< — Gx,FGx 



<> 



d 
dt 



\ 



This last inner product equals zero by Theorem 3.4. The converse part 
of the theorem is an immediate consequence of Theorems 3.3 and 3.4. 



3.4 Periodic Gain 

The result in this section concerns a positive operator formed by the 
interconnection of a periodically time-varying gain and a linear time- 
invariant convolution operator. The proof is very simple, and this 
positive operator leads to a rather elegant frequency-domain stability 
criterion that will be discussed in Chapter 6. 

Definitions: Let T be a positive real number, and let Jf T denote the 
subclass of JT determined by functions k{t) which satisfy k(t + T) = 
k(t) for almost all t. Let @ T denote the subclass of elements of ^ 
determined by functions G(jco) which satisfy G(J(oj + 2ttT~ 1 )) = G(jco) 
for almost all 00. 

It is again assumed that H = R. Each element of X~ T thus has 
associated with it a real- valued function k(t) defined on (— oo, + oo), 
with k(t + T) = k(t) for almost all / e R, k(t) e L^ % and which maps 
L % into itself according to (Kx)(t) = k(t)x(t). An operator KeJT T 
thus corresponds to a periodic gain with period T. Each element of *$ T 
has associated with it a complex- valued function G(jco) = G(-~jco) 
defined for cdgR, with G(j(a) + 2irT~ x )) = G(jto) for almost all co, 
G(]cd) g L^, and which maps the element x e L a into the element with 
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hmit-in-the-mean transform G(jco)%(ja>). An important subclass of 
operators in <& T are those defined by the convolution with kernel 
Xneign d (t - nT )> where {g n }el u and d denotes the Dirac delta 
function. 



Lemma 3.3 

Let Ke Jf T and Ge^ r . Then K and G commute on L 2 , i.e., 
KGx = GKx for all x e L 2 . 

Proof:* Since both K and G are bounded linear operators from L % into 
itself, KG and GK are. By continuity of bounded linear operators, it 
thus suffices to prove the lemma for a dense set in L 2 . Define the 
sequence {g n }, nel, by 

g„A- L G(Ja>)-»" T da>. 

2tt Jo 

It follows from the theory of Fourier series that {g n } e / 2 and that 



+ 00 



+N 



jncoT 



G(» = l.i.m. 2 S n e inaT = Hm 2 ^ 
Let i? be any element of L 2 n L a . Then 

ww(0 = 2 gX* - nT) eL 2 n L x . 

Let V and Jf^ denote the limit-in-the-mean transforms of v and w N 
respectively. Then 






Thus 



Since P(ja>) e L^ and ||^(p)|| ioo < IM^, it follows from Holder's 
inequality that 



jnmT I! 



\\W N (jco) - GUo>)V{jm)\\ Li < Mi* \\G(jco) - £ g»e : 

6 The conclusion of Lemma 3.3 is immediate if one is satisfied with the following 
formal argument: since Gx(t) = 2«=-<*> 8^* ~ nT ) and k ^ = *(' ™ nT ^ then 

^G^(f) - *(f) J g n x(t - »T) = 2 £*«*<' " b7 >C ~ wT > = GA*(f). 

n=_oo «=— oo 
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Since 2^— ivg , * fi,n<ar approaches G(yco) in the L 2 sense, this shows 
that w N approaches in the L % sense the function whose limit-in-the-mean 
transform equals G(ja))V(j(o). Hence 

+ 00 . 

w(t) = l.i.m. 2 S n v(t - nT) 

n=—oo 

exists, belongs to L a , and has G(jco)f(jco) as its limit-in-the-mean 
transform. This holds for all v e L % n L v The lemma will now be 
proved for all xeL z C\ L x . Since then Kx e L % n L u the above 
analysis applies to both x and Kx. However, k(t)g n x{t - nT) = 
g n k(t - nT)x(t - nT) for all nel, and thus 

N N 

Kt) 2 *•*(* - «T) = 2 SnKt - nT)x(t - nT), 

which, after taking the limit-in-the-mean of both sides and observing 
that k e L m% yields KGx = GKx for all x e L 2 n Lj. Since L 2 n L x is 
dense in L a , the lemma follows. 

Definitions: An operator 2? from Z, 2 into itself is said to possess a 
square root, denoted by.F 1/2 , if there exists an operator, F m , from L % 
into itself such that F = F 1 '^ 1 '*. 

Lemma 3.4 

Let Ke Jf be determined by k(t), and assume that for almost all 
* e R, k{t) > 0. Then K 1/2 exists. Moreover, K 1/z e jf and K l/2 e Df T 
ifKeJf T . 

Proof: The element of JT determined by k(t) 1/2 possesses all the 
required properties. 

Theorem 3.6 

Let K e Jf r and let G e 9^. Then JK7 and (?,£ are (strictly) positive 
operators on L 2 if for almost all teR and o> > 0, k{t) > and 
Re G(jco) > (for some e > 0, &(0 > e and Re ^(yco) > e). Moreover, 
the elements of @ T which satisfy the above inequality are the only 
elements of ^ which yield (strictly) positive operators KG and GK 
for all (strictly) positive elements K e 3f y . 

Proa/; The first part of the theorem follows from Lemma 3.3 if it is 
proved for KG. But by Lemmas 3.3 and 3.4 

<jc,1BGx> = (x,K w GKV*x). 
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Since K 1/2 eJf.it is self-adjoint, and thus 

(x,KGx) = (K^^xfiK^x), 

which is positive by Theorem 3.1. To prove the strict positivity condi- 
tion, write KG as KG — (K — d)G + *G and apply the previous part 
of this theorem and Theorem 3.1. For the converse part of the theorem, 
assume first that Re G(jco) < for all w in a set of positive measure. 
The result then follows by contradiction if one chooses K~L Assume 
next that Re G(jco) > for almost all co, but that 

G(j(co + IttT- 1 )) - G(jco) y* 

for all co in a set of positive measure, say Q. This part 'of the theorem 
is then proved by choosing particular functions for k(t) and x(t) 
which lead to (x,KGx) < 0. For simplicity assume that 

Re (G(joj) - G(j(co + 27rr~ 1 )) < 

on the set £1 (A similar argument holds for the other cases). Then there 
exists an e > such that Re (G(jco) - G(j(co + 2tt7^ 1 ))) < -e for all 
co e £2' with ii' c= Q a set of positive measure. Let Q!' n be a subset of 
[hIttT- 1 , (n -f l)27tT- x ] n Q' which is of positive measure (such a 
subset exists for some w, since O' is of positive measure). Let Q!' n -h 
klirT- 1 denote the set of points x e R such that x — k2irT~ 1 e O*. 
Choose X(jco) = 1 for weQ'^ k2<rrT-\ kel, and |A:| < N, and 
X(jco) = otherwise, and choose k(t) = 1 — cos 2trT~H. Clearly, 
kit) > and the K corresponding to k{t) belongs to K T . Let y = KGx. 
Then 

f(yo)) = G(jco)X{jto) - iG(;(o> + 27rr- 1 ))l0'(^ + IttT' 1 )) 

- \G{j{co - 2irT^))1t{j{o> - 2^7^)). 
A simple calculation shows that the inner product (x,KGx) becomes 
M + (Jtyr) Re (G(jgi) - G(j(o> + 2^T^)))^K\ 

with M a number independent of N 9 and yt*(0^) the Lebesgue measure 
of £l" n . Thus (x y KGx) can be made negative by choosing iV sufficiently 
large. 7 

7 Similar positive operators involving a time-varying gain have appeared in the 
literature. In particular the positive operator obtained by Gruber and Willems 
(Ref. 3), and in its full generality by Freedman and Zames (Ref. 4), appears to be 
especially interesting. By restricting the derivative of k(t), they obtain a class of 
operators which can then be composed with K without destroying positivity. 
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3.5 Positive Operators with Monotone or 
Odd-Monotone Nonlinearities 

In this section an answer is given to the following question: What 
is the most general linear operator that when composed with a monotone 
nondecreasing (or an odd-monotone nondecreasing) nonlinearity 
yields a positive operator? The answer to this question is the solution 
to a problem which has been studied by many previous researchers. 8 

The preliminary result obtained in this section constitutes a con- 
siderable extension of a classical rearrangement inequality. This 
inequality then forms the basis from which the positive operators are 
derived. It is felt that these rearrangement inequalities are of intrinsic 
importance and that they will be useful in other areas of system theory. 
For various technical reasons, the discussion is mainly concerned with 
sequences. With some modifications, similar results can be obtained 
for the continuous case. Hence, these will merely be stated without 
proof. 

3.5.1 Generalizations of a Classical Rearrangement Inequality 

Chapter 10 of Hardy, Littlewood, and Polya's classic book on 
inequalities (Ref. 15, p. 277) is devoted to questions relating the inner 
products of real-valued similarly-ordered sequences to the inner 
products of rearranged sequences. The simplest result given there 
states that if x x > x z > ■ • • > x n and j x > y z > • • • > y n and if 
y 9 {iu y*&h • • - >y*(n) is any rearrangement of the ^-sequence then 

n n 

z, x kyh > z x *yvm 

The informal explanation of this fact given there is that, given a lever 
arm with hooks at distances x l9 x z , . . . , x n from a pivot and weights 
Ji> y*> • ■ • , y n to nan g on tne hooks, the largest moment is obtained by 
hanging the largest weight on the farthest hook, the next largest weight 
on the next most distant hook, etc. 

This result has an interpretation in terms of positive operators. 
Suppose that/is a function from R into itself, and denote by x and Fx 

8 In particular it is the problem studied by Page (Ref. 5), Pantell (Ref. 6), and 
Black (Ref. 7) in connection with frequency-power formulas and it plays a central 
role in the determination of stability criteria for feedback systems with a monotone 
or an odd-monotone nonlinearity in the feedback loop. In the latter context it has 
been treated by Brockett and Willems (Ref. 8), Narendra and Neumann (Ref. 9), 
Zames (Ref. 10), O'Shea (Refs. 11, 12), O'Shea and Younis (Ref. 13), Zames and 
Falb (Ref. 14), and others. 
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the /z-vectors whose components are respectively x l9 x 2 , . . . , x n and 
f(xi)>f(x*)> • • • >/(*«)• Then in the language of positive operators the 
Hardy, Littlewood, and Poly a rearrangement theorem states that the 
operator N on R n defined by Nx A (/ — P)Fx is positive if / is 
the identity matrix, P is any permutation matrix, and / is monotone 
nondecreasing. 

It will be shown that this result together with a result of Birkhoff on 
the decomposition of doubly stochastic matrices permits the derivation 
of a number of interesting positivity conditions for a class of operators. 9 
The results thus represent a test for checking the positivity of a class of 
nbnquadratic forms parallel to the Sylvester test for checking the 
positive definiteness of a symmetric matrix. 

Definitions: Two sequences of real numbers {x l9 x %9 . . . x n } and 
{ji» J29 • * • J«} are sa id to be similarly ordered if the inequality x k < x x 
implies that y k < y t . Thus two sequences are similarly ordered if and 
only if they can be rearranged in such a way that the resulting sequences 
are both monotone nondecreasing; i.e., there exists a permutation rr(k) 
of the first n integers (^(k) takes on each of the values 1, 2, . . . , n just 
once as k varies through the values 1, 2, . . . , n) such that both the 
sequences {x H1)9 x wi2)9 ... 9 *.<„>} and {;;,<!,, j^a,, . . . , y n{n) } are mono- 
tone nondecreasing. Two sequences are said to be unbiased if x k y k > 
for 1 < k < n. Clearly two sequences are similarly ordered and un- 
biased if and only if the augmented sequences {x l9 x a , . . . , x n9 x^ +1 } 
and {y l9 y% 9 - . . , y n > y n +i} with x n+1 = y n+l = are similarly ordered. 
Two sequences are said to be similarly ordered and symmetric if they are 
unbiased and if the sequences {\x x \ 9 \x % \ 9 . . . , \x n \} and fl^l, \y 2 \ 9 . . . , 
\y n \} are similarly ordered. 

As an example, let f(a) be a mapping from the real line into itself, 
and consider the sequences {x l9 x a , . . . x n } and {f(x 1 ) 9 f(x % ) 9 . . . , 
f(x n )}. These two sequences will be similarly ordered for all sequences 
{x l3 x z , . . . , x n } if and only if/(cr) is a monotone nondecreasing function 
of a, i.e., if for all a x and c 2 , (a t — ^(/(^i) — f( a z)) > 0- They will be 
unbiased if and only iff(o) is a. first and third quadrant function; i.e., 
if for all <r 9 of (a) > 0. They will be similarly ordered and symmetric 
if and only if f{o) is an odd monotone nondecreasing function of a, 
i.e., if/Co) is monotone nondecreasing and/(cr) = — /(— a) for all <r. 

9 The Hardy, Littlewood, and Polya rearrangement inequality leads rather easily 
to a proof of the fact that the cross correlation of the input and the output to a 
monotone nondecreasing nonlinearity attains its maximum value at the origin. 
This approach has in fact been used by Prosser (Ref. 16) and Black (Ref. 7). 
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Definitions: A real (n x n) matrix M = (m kl ) is said to be doubly 
hyperdominant 10 with zero excess ifm kl < for k 5^ /, and if 2&=i m u — 
2r=i m Jci = f° r a ^ ^> '• ^ * s sa ^ to b e doubly hyperdominant if 
m kl < for k # /, and if 2a=i ^u > and 2/=i m ki > for all k, L 
An (n x n) matrix M is said to be doubly dominant if m n > 2*=i *** \ m u\ 
and m kk > 2f=u^fc \*n kl \. It is clear that all of the classes of matrices 
introduced above are subclasses of the class of all matrices whose 
symmetric part is nonnegative definite and that every doubly hyper- 
dominant matrix is doubly dominant. 

Two other classes of matrices that will be used in the sequel and have 
received ample attention in the past are now defined. 

A (n x n) matrix M is said to be doubly stochastic if it is nonnegative 
(i.e., m kl > for all k 9 I) and if each row and column sums to one. A 
(n x n) matrix is said to be a permutation matrix if every row and 
column contains n — 1 zero elements and ah element which equals 1 . 
The relation between the class of doubly stochastic matrices and 
permutation matrices is given in the following lemma due to Birkhoff. 

Lemma 3.5 

The set of all doubly stochastic matrices forms a convex polyhedron 
with the permutation matrices as vertices; i.e., if M is a doubly 
stochastic matrix then M = 2^i a ^* w ^h a t > 0» 2£=i a * = * > aiM * 
P i a permutation matrix. This decomposition need not be unique. 

A proof of Lemma 3.5 can be found in most books on matrix theory 
(see, e.g., Ref. 20, p. 97). 

Theorem 3.7 states the main result of this section. As mentioned 
before, it constitutes a considerable generalization of a classical 
rearrangement inequality due to Hardy, Littlewood^ and Polya. This 
inequality is stated in Lemma 3.6. 



Lemma 3.6 

Let {x l9 x 2 , . . . , x n } and {y l9 y %9 ■ • * > 7«} be two similarly ordered 
sequences, and let 7r(k) be a permutation of the first n integers. Then 

2fc=l X kyie > Z,fe=l X feJ ff (ft)- 

10 The term "dominant" is standard. "Hyperdominance" is prevalent, at least in 
the electrical network synthesis literature. The term "doubly" is used by analogy 
with "doubly stochastic" where a property of a matrix also holds for its transpose. 
Beyond this the nomenclature originates with the author. Symmetric hyperdominant 
matrices are sometimes called Stieltjes Matrices. For an interesting application of 
these matrices to the resistive iV-part problem, see Reference 17. For some general 
theorems on matrices of the type under consideration, see References 18, 19. 
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A simple proof of Lemma 3.6 can be found in Hardy, Littlewood, 
and Polya's book (Ref. 15, p. 277). A convincing plausibility argument 
is given in the introduction to this section. 

Theorem 3.7 

A necessary and sufficient condition for the bilinear form 
^Jk i x=\ m hi x kyi to be nonnegative for all similarly ordered sequences 
{x l3 x 2 > • • • x n } and {y l9 y 2 , . . . y n ) is that the matrix M = (m kl ) be 
doubly hyperdominant with zero excess. 

Proof of sufficiency: Let Mbea doubly hyperdominant matrix with 
zero excess and let r be any positive number such that > > m kl for all 
*, /. Clearly M = r[I - (l/r)(r/ » M)]. Since however (l/r)(r/ - M) 
is a doubly stochastic matrix, it can, by Lemma 3.5, be decomposed as 
2i!=i <*-iPi with a,- > 0, 2£o a, = 1 , and the P/s permutation matrices. 
Thus M can be written as M = 2ti A(/ - A) with & > 0. This 
decomposition of doubly hyperdominant matrices with zero excess 
shows that it is enough to prove the sufficiency part of Theorem 3.7 for 
the matrices / — P^ This, however, is precisely what is stated in 
Lemma 3.6. 

Proof of necessity : The matrix M may fail to be doubly hyperdominant 
with zero excess because m kl > for some k # / in which case the 
sequences with n — 1 zero elements except + 1 and — 1 in respectively 
the &th and /th places lead to 2fc,/=i m u x kyx = ~ m ki< 0- Assume next 
that the matrix M fails to be doubly hyperdominant with zero excess 
because 2*=i *w** ^ f° r some ' ( a similar argument holds if 2?Li '"jm ^ 

for some k), and consider the similarly ordered sequences {1, . , . , 1, 

1 + €, 1, . . . , 1} and {0, . . . , 0, er 1 , 0, . , . , 0} with e ^ 0, and the 
elements 1 + e and e" 1 in the /th place. This leads to 2 fc Uml m kl x k y t — 
e" 1 2?=i Mm + m iv By taking e sufficiently small and of an appropriate 
sign, ^ t i ==l m kl x k y l can then indeed be made negative. 



The following two theorems are generalizations of Theorem 3.7 to 
similarly ordered unbiased and to similarly ordered symmetric sequences. 

Theorem 3.8 

A necessary and sufficient condition for the bilinear form 
lLk t i=i m ki x kyi to b e nonnegative for all similarly ordered unbiased 
sequences {x l9 x % , . . . , x n } and {y Xi y^ . . . , y n ) is that the matrix 
M = (m kl ) be doubly hyperdominant. 
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Proof of sufficiency: Let M be a doubly hyperdominant matrix and 
define m kn+1 = -2?=i w w» m n+i t i = -2£=i m *i fo * K / < «, and 
»Wi,ii+i = 2*.i=i "%• Then taking x 7H . 1 = j w+1 = it follows from 
Theorem 3.7 that ^ti^m^^z « B (W m t ^, > since the aug- 
mented (n + 1 x ri + 1) matrix M* = (»%), fc, / = 1 , 2, . . . , « + 1 is 
doubly hyperdominant with zero excess and since the sequences 
{xi, x z , . . . , x n9 x n+1 } and {y l9 y z , . . . , ;/„, y n+1 } with x n+1 = j n+1 = 
are similarly ordered. 

Proof of necessity : The same sequences as in Theorem 3.7 can be used 
if the matrix M fails to be doubly hyperdominant because m kl > for 
some k ^ I. Assume next that the matrix M fails to be doubly hyper- 
dominant because Jj^ m kl < for some / (a similar argument holds 
if 2r=! m ki < for some A:), and consider the sequences used in 
Theorem 3.7 with the additional restriction that e > 0. Notice that 
these sequences are then similarly ordered and unbiased. It follows that 
by taking e sufficiently small ^li^m^x^ = ^Xk^x^i + ™u can 
thus be made negative. 



Theorem 3.9 

A necessary and sufficient condition for the bilinear form 
2fc, z =a WkiXkyt to be nonnegative for all similarly ordered symmetric 
sequences {x u x Z9 . . . , x n } and {y u y Z9 . . . ,y n } is that the matrix 
M ~ (m kl ) be doubly dominant. 

Proof of sufficiency: Let M be a doubly dominant matrix. Clearly, 

n n n 

2 rn*ix*yi > 2 rn kl \x k \ \y t \ - 2 l^ill**! Ittl. 

k.l=l fc,Z=l fc,Z=l 

k^l k*l 

The right-hand side of the this inequality is nonnegative by Theorem 
3.8, since the matrix M* = (m kl ) with m\ % — 



m h 



m 



when k ~ I and 
,^_ —\m kl \ when k ^ / is doubly hyperdominant and since the 
sequences {\ Xl \, [x 2 |, . . . , \x n \} and {IjKiU l^l, -•* , W> are similarly 
ordered and unbiased. This implies that ^Ji.i=xm u x k y l > 0. 

Proof of necessity: Assume that the matrix M fails to be doubly 
dominant because m u — 2£-i,fc*i \ m ki\ < ° for some / (an analogous 
argument holds if m kk — 2?=i.z?*& l m fc«l < ° for some *0» and consider 
the sequences {-sgn m ll9 . . . , — sgn iw w> „ 1 + e, -sgn w J+1>z , 
— sgn m nl } and {0, . . . , 0, e" 1 , 0, . . . , 0} with sgn oc = a/|a| if a ^ 0, 
sgn = 0, e > 0, and the elements 1 + e and e" 1 in the /th places. 
These sequences are similarly ordered and symmetric and lead to 
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2*.j-iW M x*yi = c-Kwiw - 2jU.**i |w w |) + w„, which, by taking € 
sufficiently small, yields 2*,*=i m hi x kyi < 0. 

Let/be a mapping from 7? into JR and denote by Fthe mapping from 
R n into itself which takes the element col (x l9 jc a , . . . , x n ) into 
col (/(xj, /0c a ), • • • > /(**))■ Then in terms of positive operators, 
Theorems 3.7-3.9 become 

Theorem 3.10 
Let M be an (n X n) matrix and let /be one of the following: 

1. A monotone nondecreasing function. 

2. A monotone nondecreasing first and third quadrant function. 

3. An odd-monotone nondecreasing function. 

Then MFis a positive operator on R n for all mappings /satisfying one 
of the conditions 1-3 if and only if M is, respectively, 

1. A doubly hyperdominant matrix with zero excess. 

2. A doubly hyperdominant matrix. 

3. A doubly dominant matrix. 

3.52 Extension to l 2 -summable Sequences 

In this section l^ is taken over the field of real numbers unless 
otherwise mentioned. 

Definitions: Let JS? (4,4) denote all bounded linear transformations 
from 4 into itself. Let R e J2?(/ a ,/ a ). Then R determines an array of real 
numbers {r kl }, k,lel, such that y = Rx is defined by y k — "2t2L« r kl x z 
for x = {x k } and j = {y k }, k e I (Reference 21, p. 50). This infinite 
sum exists for all x e 4 and the resulting sequence belongs to / a . A 
standard result in the theory of bounded linear operators in Hilbert 
space (Ref. 21 , p. 52) states that the array {r*,}, k, I e /corresponding to 
the adjoint R*, is given by r* x = r lk for all l,ksL It is not known what 
arrays determine elements of J?(! Z ,I 2 ). The following lemma, however, 
covers a wide class. 



Lemma 3.7 

Let the array {r kl }, k, I e /, be such that the sequences {r kl } belong to 
4 for fixed k and /, uniformly in k and /; i.e., there exists an M < go such 
that 2t-co k„| < M and 2£^-« |r w | < M. Then {r kl } determines an 
element R of J2?(/ a ,/ a ) and ||jR|| < M. 
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Proof: The Schwartz inequality and Fubini's Theorem for sequences 
(Ref. 27, p. 245) yield the following inequalities: 

(+oo I +oo 2\l/2 r +co / +co \2-jl/2 

2 I'** I < 2 I 2>»ii*,i) 
*=— OO j l=— OO / |_&=— 00 \f=— 00 / J 

[+00 / +00 \ / +00 M 1 /^ 

2 2\r»\)( 2>«ll*.n 
fc=-co\l=-co / \Z=-00 /J 

(+oo \l/2 



<M 



which proves the lemma. 

In what follows an important role will be played by some particular 
elements of J§? (4,4) and some particular sequences which will now be 
introduced. 

Definitions: The definitions of similarly ordered, similarly ordered 
unbiased, and similarly ordered symmetric infinite sequences are 
completely analogous to those for finite sequences and will not be 
repeated here. It is easy to show that two sequences in 4 are similarly 
ordered if and only if they are similarly ordered and unbiased. Let M 
be an element of J£?(4,4X and let {?n fc J, k, I e /be the associated array. 
M is said to be doubly hyperdominant if m kl < for k # / and if 
2fc=-oo m u and 2j^°-°° m hi ex i st an <* are nonnegative for all / and k. M 
is said to be doubly dominant if 



+ 00 

™u> 2 \ m ki\ and 

ft=-00 



> 2 l™& 



It is clear from Lemma 3.7 that if an array or real numbers {m kl }, k, 
lei, satisfies the doubly dominance condition and if the sequence 
{m^e^, then {m kl } determines an element M of jS?(/ a ,/ a ) with 
||M|| < 2 sup^ m kk . Thus it is a relatively simple matter to check 
whether or not an element of 3? (4,4) is doubly hyperdominant or 
doubly dominant. 

The following extensions of Theorems 3.8 and 3.9 hold: 

Theorem 3.11 

Let M be an element of -S?(4>4). Then a necessary and sufficient 
condition for the inner product (x,My) t2 to be nonnegative for all 
similarly ordered unbiased 4" sec l u ences x and y (similarly ordered 
symmetric 4-sequences x and y) is that M be doubly hyperdominant 
(doubly dominant). 
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Proof: It is clear that all finite subsequences of x and y are similarly 
ordered and unbiased or similarly ordered and symmetric. Hence, by 
Theorems 3.8 and 3.9 all finite truncations of the infinite sum in the 
inner product (x,My) yield a nonnegative number. Thus the limit, 
since it exists, is also nonnegative. 



Of particular interest are the arrays {r kl }, k, lei, for which the 
entries depend on the difference of the indices k and / only. These 
arrays are said to be of the Toeplitz type and have been intensively 
studied in classical analysis (Ref. 23). It follows from Lemma 3.7 that 
if the array {r kl — r k _^, k, I si, is of the Toeplitz type then it de- 
termines an element of ^(4,4) if {r k }, k el, belongs to 4. (In fact, the 
elements of JS?(4»4) for which the associated array is of the Toeplitz 
type stand in one-to-one correspondence to all / 2 -summable sequences 
whose limit-in-the-mean z-transform belongs to L^ for \z\ = 1.) An 
element of j£?(4,4) is said to be of the Toeplitz type if the associated 
array is of the Toeplitz type. An element R of J§?(4,4) which is of the 
Toeplitz type thus determines a sequence {r k }, k s I, with {r k } e 4 and 
whose limit-in-the-mean z-transform belongs to L^ for \z\ = 1. The 
importance of these linear transformations stems from the fact that they 
define convolution operators with a time-invariant kernel and are 
therefore associated with time-invariant systems. 11 

Definitions: A sequence of real numbers {a k }, kel, is said to be 
hyperdominant if {a k } e l u if a k < for all k ^ 0, and if 2* *-«> #& > 0- 
It is said to be dominant if {a k } e 4, and if 2a > 2fc~«> \ a k\- 



Theorem 3.12 

Let M be an element of J£f (4,4) which is of the Toeplitz type. Then a 
necessary and sufficient condition for the inner product (x 9 My) to be 
nonnegative for all similarly ordered unbiased / a -sequences jc and y 
(similarly ordered symmetric / 2 -sequences x and y) is that the sequence 
{m k }, kel, which is determined by M be hyperdominant (dominant). 



Theorem 3.12 is a special case of Theorem 3.11. Theorems 3.11 and 
3.12 have an obvious interpretation in terms of positive operators. 
Moreover, Theorem 3.12 yields some simple properties of the input 

11 A definition of time invariance for operators defined on the continuous real line 
has been given in Chapter 2. The extension of the definition to operators defined with 
the integers as the time-interval of definition is obvious. It is then a simple matter to 
verify that an element RS JSP(4>M is of the Toeplitz type if and only if R is time 
invariant. 
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and the output spectra to (odd) monotone nondecreasing nonlinearities. 
This is stated explicitly in Theorem 3.13. 

Definitions: Let s£ denote the class of operators from / 2 into itself, 
each element of which has associated with it a function A(z), with 
A(z) e L^ for \z\ = 1, A{z) = A(z), and which maps the element x of 
/ 2 as follows: let X denote the limit-in-the-mean z-transform of x. The 
image of x is the sequence y whose limit-in-the-mean z-transform 
equals A(z)$(z). 

Let F denote the class of operators from / 2 into itself, each element of 
which has associated with it a function /(a) from R into itself, which 
satisfies the inequality |/(c)| < M \a\ for some M < oo and all a e R 
and which maps the sequence x = {x h }, kel, of / 2 into the sequence 
7 = {jJwith i y fc A/(A: fc ). 

It is a simple matter to verify that these operators are indeed well 
defined, i.e., that they map / a into itself. The class si stands in one-to- 
one correspondence with all / 2 -sequences whose limit-in-the-mean 
z-transform belongs to L^ for \z\ = 1. Moreover if {a k }el % and 
A(z) e L^ for \z\ — 1 are such a sequence and its limit-in-the-mean 
z-transform, then the element of s& which has the function A(z) 
associated with it maps / a into itself according to the convolution 

24- co 

Theorem 3.13 

Let Aes$ and F e $F \ Then AF is a positive operator on / 2 if the 
following statements are true: 

1. The / corresponding to JF is a (odd) monotone nondecreasing first 
and third quadrant function. 

2. The inverse z-transform of A(z) is (dominant) hyperdominant. 

Moreover, the elements of s/ satisfying statement 2 are the most 
general elements of si which yield a nonnegative operator AF on / 2 , 
for any FsF satisfying statement 1. Finally AF is a strictly positive 
operator on / 2 if A — eJ and F — el satisfy statements 1 and 2 for 
some e > 0. 

The proof of Theorem 3.13 follows from Theorem 3.12. Theorem 
3.13 states that if Jf(z) and Y(z) are the limit-in-the-mean z-transforms 
of the input and the output of a (odd) monotone nondecreasing 
nonlinearity then 






A(z)£(z)?(z)dz>0 
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for any A(z) which is the z-transform of a (dominant) hyperdominant 
sequence. 



3.6 Frequency-Power Relations for Nonlinear Resistors 

Jn this section a class of positive operators formed by the composition 
of a linear time-invariant convolution operator and a (odd) monotone 
nondecreasing nonlinearity will be derived. The analysis is done for 
operators on L 2 , but the results are also stated for almost periodic 
functions — thus placing the positive operators obtained in this section 
in the context of the classical frequency-power relations for nonlinear 
resistors. 

Definitions: Let Jt denote the class of operators from L 2 into itself 
each element of which belongs to the class $F and for which the asso- 
ciated function/is monotone nondecreasing; i.e., {a x — o^X/fai) — 
/(°a)) > ^r all a l9 a 2 e R. Let 3? denote the class of operators from 
L 2 into itself each element of which belongs to Jf and for which the 
associated function/is in addition an odd function; {.$.,/(&) = — /(— a) 
for all a e K 

Let x lt x % e £ 2 . Then x % {t + r)eL 2 for all r e R, and \\x 2 (t + r)\\ = 
II **(/)!!• The crosscorrelation function of x x and x z is defined as the 
function R XlX2 (r) = {x x (t), x % (t + r)>. Note that the Schwartz inequality 
yields \R Xx<Cz (t)\ < HxJ ||* a ||. Moreover, since the limit-in-the-mean 
transforms of x(t) and x(t + t) are given by J?(/eu) and ${ja>)e j< * r 
respectively, it follows from Parseval's relation that 



K-iiM « T r^Xii-J^Ujude^ dm. 

ZTTJ-ao 



The above crosscorrelation-function inequality states in particular 
that 

IH^MI < 1^(0) = \\x.\W 

The theorem which follows is a generalization of this well-known 
property of autocorrelation functions. It states that the crosscorrelation 
function of x and y attains its maximum at the origin provided x and y 
are related through a monotone nondecreasing nonlinearity. 12 

12 This property follows rather easily from the Hardy, Littlewood, and Polya 
rearrangement inequality. This was pointed out by Prosser (Ref. 16) and Black 
(Ref. 7). The inequality R^ifl) > K*w(0 + **»(-')) is easy to obtain and holds for 
any x and y which are related through an incrementally positive time-invariant 
operator. This inequality in turn implies incremental positivity and time invariance. 
It is not clear, except in the scalar case, precisely what conditions on F are necessary 
and sufficient to insure that i? w (0) > ^(f )(&»(()) > 1^(01) for any x and y = Fx. 
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Theorem 3.14 

Let F^Jt, xe L % and let y = Fx. Then 7^,(0) > R xy (t) for all 
/ e R. If F belongs to Sf \ then R xy (0) > \R m (t)\ for all t e R, 

Proof; Let F(a) = llf{x) dx. Then F(<x) is a convex function of a 
(since the derivative of F{a) exists and is monotone nondecreasing). 
The convex function inequality (Ref. 24) yields that (a x - o^f(a^ > 
F{ a i) — F( a %) for all a l5 c 2 g i?. (This inequality can simply be 
obtained by integrating f{a) — f(o x ) from a ± to a 2 .) Taking a x = 
x(/ + r) and c 2 ^ *(0 it follows that 

(x(t) - x(f + r))y(t) > F(x(0) - F(x(t + r)), 

which after integration yields 

&JP) ~ *«to > r°°F(x(t)) dt - r°°F(x(/ + r)) dt = 0. 

■/ — 00 J— 00 

The integrals on the right-hand side exist since by assumption F e Jt 
and thus \f(<r)\ < K \a\ for some K and all a e R, which implies that 
\F(a)\ < K |<r| 2 for all a e R. Hence i?^(0) > R xy (t) for all F e Jt and 
f e i?. If /is in addition odd, then the convex function inequality can 
be rewritten as {a x - (— <r a )]/(0i) > F(o^ — F(-or 2 ), which, since 
/is odd, yields that (o\ + <r a )/(oi) > F(a x ) - F(cr 2 ). Exactly the same 
argument as above then leads to R ay (0) + R xy (t) > for all t e R, 
Thus, 7*^(0) > \R xy (t)\ for all Fe & and f 6 i*. 

Remark: It can be shown (Ref. 25) that Theorem 3.14 is also sufficient 
in the sense that if y = Fx for some Fe/ and if R xy (0) > 7^(0 
CKtfv(O) > \R xy (t)\), for all x e L 2 and * g i?, then FeJt{&). 

Theorem 3.15 

Let F e ^(50 and let G e ^ be determined by the function G(;o)) 
given by the Fourier-Stieltjes integral 



GO") 



-GO 



d7(r) 



where V(r) is any monotone nondecreasing function (any function of 
bounded variation) of total variation less than or equal to unity. Then 
GF is a nonnegative operator on L 2 . 

Proof: Assume first that F e Jt. The theorem follows from the previous 
theorem if it is noted that 7^(0) > and that the operator G is 



68 POSITIVE OPERATORS 



defined by the convolution 

/N-oo 

y(t) - (Gx)(0 = x(t) -j^ x(t - r) dV(r). 

Indeed, let y = Fx. Thus 

(x,Gy) = c^(0) + J1S [1^(0) - R xy (r)] dV{r\ 

where c 2 = 1 — the total variation of V. Note that the above integrals 
exist, since R^ is bounded and since V is of bounded total variation. 
Thus (x,Gy) = {x,GFx} > by Theorem 3.14. The oddnnonotone 
case is proved in a similar way. 

Remark: GF will be a positive operator on L a if F — el e ^{6^) for 
some € > and if the total variation of V is strictly less than unity. 

Theorem 3.16 

Let Fand G satisfy the conditions of Theorem 3.15 and assume that 
the function /which determines F satisfies a Lipschitz condition on R, 
Then (G + fl d\di)F is a nonnegative operator on S\ for all a e R. 

The proof of Theorem 3.16 follows from Theorem 3.4 and 3.15. 
Theorem 3.16 states that if It and f are the limit-in-the-mean trans- 
forms of x and y — Fx with x and F as in Theorem 3.16, then 



/: 







M x {joi)X{-i<^)Y{joS) dm 
for all functions M^jca) given by the Fourier-Stieltjes integral 
Mx(» = 1 + a/to - I"*" \- ie " dVy.{r), 

J— 00 

where a e R and V^r) satisfies the conditions of Theorem 3.15. 

There is one possible refinement of this result that has, however, no 
immediate interpretation in terms of positive operators unless additional 
smoothness assumptions are made on x. Consider the functions of the 
form 13 

e*™ - ja>Tg(T) 



M 2 (yo>) 



J— 00 



dV(r), 



13 Functions of this type have been studied in probability analysis in connection 
with characteristic functions of spatially homogeneous diffusion processes and in- 
finitely divisible distributions (see e.g. Ref. 36, p. 654 and Ref. 26, p. 541). The reason 
these functions also appear in this context is roughly the following: the doubly 
hyperdominant matrices with zero excess are the logarithms of doubly stochastic 
matrices. Thus by considering Markov processes defined on (—00,4-00) rather than 
finite Markov chains one naturally obtains the continuous versions of the doubly 
hyperdominant matrices. The transfer functions of the convolution operators 
obtained here are thus the characteristic functions of the probability density functions 
of these diffusion processes. 
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with V 2 (r) a monotone nondecreasing function of r (any function of 
r which is a bounded variation over compact sets) such that 



f 

Jx 



dV 2 (r) 



and 



f- x dV,(r) 

Loo T 2 



exist for x > 0, and g{r) is any bounded real-valued function of r 
which is continuous at the origin and with g(Q) = 1 (it can be shown 
that under these conditions M z (jai) is well defined). It is then possible 
to show — ■ using an argument which is completely analogous to the one 
used above — that the integral 



s: 



M(ja>)X(-ja))f(ja>) dco 



exists and is nonnegative for any M(jo)) = M^jco) + M 2 (jco) with 
M 1 (jfco) and M z (jco) of the form given above. 

The following simple functions of co belong to this class (for the 
monotone case) and are of particular interest (Ref. 26, p. 541): 

1. The function 

M(jco) = 1 - y exp (-\co\ T ), 

where y and t are real numbers satisfying < y < 1 and < r < 2. 

2. The function 

M(j(o) = 1 — e{co\ 

where e(a>) is any real-valued, nonnegative even function of a> which 
is convex for o> > and with e(0) < 1 . 

3. The functions 



co > 0, 



M(j<o) = |co|Tl -f-7<5tan — 


for 


M(—ja>) — M(jco) for 


co < 0. 


4. The function 





M(ja>) = I co I 



1+jd 



in HI 



M(—jco) = M(jco) for 



for co > 0, 



0. 



where r, d, and co are real numbers satisfying 0<t<2, t<1, 
|<5| < 1, and <o > 0. 
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In the remainder of this section these results are tied in with the 
frequency-power formulas. Frequency-power formulas are relations 
between the power inputs and the power outputs of a nonlinear device 
at various frequencies of the almost periodic input. The discussion will 
be mainly concerned with nonlinear resistors. Frequency-power 
relations of the type given here have found application in the design of 
frequency converters and express fundamental limitations of such 
devices. 14 The proofs of the relations which follow will not be given 
since they are completely analogous to the parallel relations obtained 
above for L 2 -functions. 

Let x be an almost periodic function of t, and let /be a continuous 
map from R into itself. It follows then from the smoothness conditions 
on/that jy(0 =f(x(t)) is also almost periodic. 

Definitions: Let co k be a basic frequency common to both x(t) and y(t) 
and let x k and y k be the corresponding Fourier coefficients. Let co k > 0. 
Then the complex power R k9 the active power P k> and the reactive 
power Q k absorbed by the nonlinearity / at frequency co k are defined as 



** A ix k y k 



P k A Re R k 



Q k AlmR k . 



Frequency-power formulas are relations between the active and reactive 
powers absorbed by the nonlinear resistor at the different frequencies. 

The first relation which can then be obtained is the analogue of 
Theorem 3.4 and states that if x and x are almost periodic functions of t, 
then 

2 co k Q k = 0, 

i.e., the weighted sum of the reactive power absorbed at various 
frequencies is zero. This formula is known as the Manley-Rowe 
frequency-power formula. 

If /is a first and third quadrant function then, in analogy with 
Theorem 3.3, 

I P* > o. 

If/ is in addition monotone nondecreasing (/is then usually referred 
to as a nonlinear resistor), then the following general frequency-power 

14 For a complete treatment of frequency-power formulas see the book by Penfield 
(Ref, 27). The study of these formulas was initiated by Manley and Rowe (Ref. 28), 
and the work of Carroll (Ref. 29) and Black (Ref. 7) is particularly appropriate in 
relation to the results presented here. 
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relation can be obtained in a straightforward fashion : 
Re 2 **M(JO > 0. 

The particular choices of M given in the preceding list lead to the 
following simple frequency-power formulas : 

1. The formula 

2[l-yexp(-Kr)]P,>0, 

where y and t are real numbers satisfying < y < 1 and < t < 2. 

2. The formula 

2 (1 - e(a> k ))P k > 0, 

where e(co) is any real- valued, nonnegative, even function of co 
which is convex for co > and with e(0) < 1 . 

3. The formulas 



IKrta + e^tan^) >0 



and 



2 coJp k + Q k 6ln^)>0, 

Wfc^O \ O) / 



where r, d and co are real numbers satisfying 0<t<2, t^ 1, 
|d| < 1, anda> > 0. 

Remark I; For nonlinear capacitors with voltage versus charge 
characteristic v ^.f{q) where /satisfies the same assumptions as above, 
analogous frequency-power formulas can be obtained with R k replaced 
hyjR k la) k . The same is true for nonlinear inductors with current versus 
flux characteristic i =/(<£) with R k replaced by R k jjo> k . 

Remark 2; An important refinement of the frequency-power relations 
can be obtained if the input x(t) to the nonlinearity / is assumed to 
be quasi-periodic, i.e., if it is assumed that the basic frequencies co k 
consist of linear combinations of N fundamental frequencies. Thus, let 
co l9 co 2 , . . . , co v be N given real nonnegative numbers, let n u n 2 , . . . , n N 
be integers, and assume that the basic frequencies in x are of the form 



co, 



n 1 n 2 ...» iV r 



= 2/7=i n i 0} i- ft can tnen b e shown that the basic frequencies 
of j are of the same type and that the Fourier coefficient in the output 
y at frequency a> n n w depends on the nonlinearity /, the set of 
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integers {n % ) for i = 1, 2, . . . , N 9 and the set of Fourier coefficients in 
x -> \*n x n % „.n }> n u n 2 , * • • > "a- G A Dut * s independent of a> ls ct> a , . . . , ca^. 15 
Moreover, since 

it immediately follows that each factor of the different w/s in the above 
equality must be zero. Hence, 



2L '*litinx'»2*..«.flr 
nt>0 







2 **G. 



« t >0 



«1«2— "if 



= 0. 



In a similar manner one can then argue that with M as before, 
Re 2 i? Bl „,..„. v M(;(«ifi + «,*, + . . . + n^)) > 

for any real numbers f l5 £ 2 , . . . , l A r- 

3.7 Factorization of Operators 

In many problems in system theory, e.g., in stability theory, in 
optimal control theory, and in prediction theory, there is particular 
interest in causal operators. For instance, in network synthesis it is 
clear that a synthesis procedure for passive nonlinear networks will 
require the operator defining the input-output relation to be both 
positive and causal. The importance to stability theory of generating 
positive operators that are also causal will become more apparent later. 
In this section some techniques for generating a causal positive operator 
from an arbitrary positive operator are developed. The basic idea is 
simple and is expressed in the next theorem. The definition of a causal 
operator has been given in Section 2.4. 



Theorem 3.17. 

Let Fbe an (incrementally) positive operator on H^with W& Hilbert 
space of functions defined on the time-interval of definition S and satis- 
fying the usual properties with respect to truncations. Assume that F 

15 These facts are obvious when the function/is a power Jaw or a linear combina- 
tion of power laws. The extension to arbitrary functions / then follows readily by 
approximation. 



FACTORIZATION OF OPERATORS 73 

admits a factorization F ~ F ± F 2 on W with F 2 causal on W and F x an 
invertible bounded linear operator on W with (F*)" 1 causal on W. 
Then ^(F*)" 1 is (incrementally) positive and causal on W. 

Proof: Let x e W. Then 

<*, WiT 1 *) = <F 1 *(Ffr x ^F a (F 1 *)- 1 x> 

= mr^Fj^F*)-^) > o. 

Hence F^F*)" 1 is positive on W. Incremental positivity is proved in 
a similar way, and causality of F^F*)' 1 follows since F 2 and (F*)- 1 
are causal by assumption. 

This theorem and the resulting possibility of generating a causal 
positive operator from a noncausal positive operator show the impor- 
tance of obtaining sufficient conditions for a factorization as required in 
the theorem to be possible. Similar problems have received a great deal 
of attention in classical prediction theory, in the theory of linear integral 
equations, and in probability theory. 16 The existing results deal almost 
exclusively with linear time-invariant convolution- type operators in 
Hilbert spaces, and most of the analysis uses the fact that these operators 
are commutative in an essential way. The operators considered here, 
however, need not have this property. The factorization theorem 
obtained in this section is felt to be of great interest in its own right. It 
applies in particular to linear operators whose kernel might be time- 
varying and which need therefore not be commutative. 

All the nonlinear positive operators obtained in this chapter are 
compositions of a linear operator and a memoryless (nonlinear) 
operator. The problem of generating a causal positive operator is thus 
by virtue of Theorem 3.17 reduced to the factorization of a linear 
operator. The discussion will therefore be restricted to linear operators. 

The factorization problem is one of considerable interest and 
importance, and the natural setting for its study appears to be a Banach 
algebra. Thus, assume that the operators under consideration form a 
Banach algebra. As is easily verified, the causal operators will then 
form a subalgebra since causal operators are closed under addition, 
under composition, and under multiplication by scalars. This is the 
reason for introducing the projection operators and stating the theorem 
in terms of arbitrary projections and elements of a Banach Algebra. 

The general factorization theorem thus obtained is then specialized 
to certain classes of linear operators in Hilbert space. It is also shown 

16 These problems permeate the work of Wiener (Ref. 30) and Krein (Ref. 31). 



74 POSITIVE OPERATORS 

that in the case of certain convolution operators with a time-invariant 
kernel the results are rather conservative, and less restrictive factoriza- 
tion theorems due to Krein (Ref. 31) exist. The setting of the factoriza- 
tion problem is the same as used by Zames and Falb (Ref. 14), but 
the results are more general. The method of proof is inspired by a paper 
by Baxter (Ref. 32) in probability theory. 

Definitions: The relevant notions of Banach algebras have been 
introduced in Section 2.5. Let a be a Banach algebra with a unit /. A 
bounded linear transformation, 7r, from a into itself is said to be a 
projection on a if tt 2 = ir and if the range of tt forms a subalgebra of cr. 
Note that the range of a projection is thus assumed to be closed under 
addition and multiplication. The norm ofrr, denoted by ||7r||, is defined 
in the usual way as the greatest lower bound of all numbers M which 
satisfy for all Aea the inequality \\irA\\ <M|U||. 17 The identity 
transformation on a is denoted by 0. 

The following factorization theorem states the main result of this 
section. 18 



Theorem 3.18 

Let a be a linear Banach algebra with a unit element / and let rr+ and 
rr~ = — 7r + be projections on cr. Let <r+ and <r~ be the ranges of tt + 
and ir- 9 and assume that ||7r + || < 1 and that ||*r|| < 1. Let Z be Jan 
element of cr and let p be a scalar. If\\Z\\ < |/>|, then there exist elements 
Z + e a and Z~ e a such that: 

1. M= P I-Z~ZZ+; 

2. Z + and Z~ are invertible in a; and 

3. Z + and (Z+y 1 belong to cr+ © J, 19 and Z" and (Z')' 1 belong to 

or"©/. 

In order to prove Theorem 3.18, Lemmas 3.8-3.10 will be established 
first. 



17 Contrary to Hilbert spaces, projections in Banach spaces need not have norm 
less than or equal to one. Since the spaces under consideration here are spaces of 
bounded linear operators, there is in general no inner product structure on them, and 
the condition ||7r + || < 1 thus becomes a constraint. 

18 A similar theorem, due to Masani, with the estimate \\Z\\ < p/4 has appeared 
in the literature (Ref. 33). Another interesting factorization theorem in a somewhat 
different setting has been obtained by Gohberg and Krein (Ref. 35). 

19 The notation a + © /means all elements of or which are of the form R + al with 
R 6 a + and a a scalar; a~ © / is similarly denned. 



FACTORIZATION OF OPERATORS 75 

Lemma 3.8 

Let {A k } 9 {P k }, and {N k }, k — 1, 2, ... be sequences of elements of 
o% cr**, and a~ respectively and assume that for some r > and all 
|r| < r : 1, The series A = I+ 2^i A *r\ P = / + 2^i^ 7 % and 
JV=/+ 2r=i^fc r * converge; and 2, the elements are related by 
A = PN. Then A uniquely determines the sequences {PJ and {JVJ. 

Proof: Equating coefficients of equal powers in r in the equality 
A = iWleadstoPi + N x = ^andP n + N n = A n ^^lZ\P k N n _ k ior 
n = 2, 3, . . . . Thus P n = 7r+(A n - j£Z\P*N*-J and N n = rr~(A n - 
2fc=i i > A: iV 7 ,_ fc ) which shows that A uniquely determines P n and N n 
provided it uniquely determines P l9 . . . , P Vr _ 1 and N l9 . . . , JV B _!. Since 
A uniquely determines P x and N x by P x — -tt + A 1 and N x = irA u the 
result follows by induction. 

Lemma 3.9 

The equations P — I + rn*(ZP) and N=I+ rv-{NZ) have a 
unique solution Pea and N e a for all \r\ < |p| _1 . Moreover, these 
solutions are given by the convergent series P = ^ k=1 P k r 7c and JV = 
2^ JV* with P = N - /, P k+1 = <rr+{ZP k \ and N k+1 = ^(N^). 
Also, P e cr 4- © / and Neo- ® I. 

Proof: The result follows from the inequalities 

\\nT+(Z(A-B))\\<\ P \-i\\Z\\ M-JB||, 

HrTr-a^-^zXKH-MlzilM-aii, 

and the contraction mapping principle. Moreover, it is easily verified 
that the successive approximations induced by this contraction mapping 
with P «= N = I yield the power series expressions of P and N as 
claimed in the lemma. 

Lemma 3.10 

The solutions P and N to the equations of Lemma 3.9 are invertible 
for all \r\ < |p|~\ the inverses being given by P~ x = / — r7r + (iVZ) and 
N" 1 = / - r7r-(ZP). Moreover, A^P" 1 = I - rZ for all |r| < IH" 1 . 
Also, P- 1 e C7+ © / and N~ x e 0- © /. ■ 

Proo/; From the equations defining P and JV it follows for \r\ < Ip] -1 
that 



\m + (NZ)\\ < 



M ||Z|| 



1 - \r\ IIZII 



and ||r7r"~(ZP)|| < 



\r\ m 



1 ~ \r\ ||Z|| 
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Since all elements of a which are of the form I — B with ||2?|| < 1 are 
invertible, it thus follows that / - rir+{NZ), / — rir~{ZP) and / - rZ 
are invertible for \r\ < |/>| -1 /2. Furthermore, the inverses are given by 
the convergent series 

(/ - rit\NZ)T x = / + f (w + (JVZ)) fc r*, 



fc=i 



(J - rZ)' 



.-1 






From the equations of P and JV, it follows that (/ — rZ)P = I — 
nr~(ZP) and N(I - rZ) = / - rir+(NZ) for |r | < |p|-* and thus that 
(/ - rZ)- 1 = P(/ ~ rTr-(ZP))" 1 = (J - rir+iNZ^N for \r\ < Ipi" 1 ^. 
Since all factors in the equalities are given by the convergent series 
given in this lemma and in Lemma 3.9 and since <r + and a~ are closed 
under multiplication, Lemma 3.8 is applicable. This yields P = 
(/ - nr+(NZ))~\ N=(I~ nr-{ZP))^ and PN = (/ - rZ)~* for 
\r\ < ipl' 1 / 2 - Thus for \r\ < |p| -1 /2 the following equalities hold: 

P(I - r7r+(NZ)) = (I - r<rr+(NZ))P = /, 
N(I - r7f(ZP)) = (/ - rir'{ZP))N = 7, 
(/ - r7r-(ZP))(/ - rir+(NZ)) = / - rZ. 

Since, for |r| < |/>|~\ all terms in the above equalities are given by 
geometrically convergent power series in r, they are analytic functions 
of r for \r\ < Ipl" 1 . Since equality holds for \r\ < |/)| _1 /2 it is thus 
concluded from analyticity that equality holds for all \r\ < |p| _1 . 

Proof of Theorem 3.18 

Let r = p~ x in the Lemma 3.10. The theorem follows with Z~ = 
p(/ _ p -hr-(ZP)) t (Z-)- 1 = p-W, Z+ = / - p~ht+(NZ) and (Z+)- 1 = 
P. 



Under a suitable choice of the Banach algebra and the projection 
operators a number of interesting corollaries to Theorem 3.18 hold, 
two of which will now be stated. 

Let R be an element of J$?(/ 2 ,/ 3 ) and let {r kl } 9 k, IeI be the corre- 
sponding array. R is said to belong to ^ + (l^,I^) if r kl — for all k < /. 
It is said to belong to jS?-(/ a ,/ a ) if i?* belongs to -S?+(/ a ,/a). Elements of 



FACTORIZATION OF OPERATORS 77 

J5f + correspond to causal elements of jSf whereas elements of J2P- 
correspond to anticausal elements of ££ . 



Corollary 3.18.1 

Let Z be an element of J§? (/ 2 ,/ 2 ) such that Z — e/is doubly dominant 
for some e > 0. Then there exist elements M and N of J2?(/ a ,/ 2 ) such 
that: 

1. Z = AW; 

2. Af and N have bounded inverses Af _1 and N^ 1 ; and 

3. iV* and JV' 1 belong to J§?+(/ 2 ,/ 2 ), and M and M" 1 belong to ^"(/ 2 ,/ 2 ). 20 

Corollary 3.18.2 

Let ^4(z) — e be the r-transform of a sequence which is dominant for 
some € > 0. Then there exist functions A+(z) and A~(z) such that 

1, A(z) = A~(z)A+(z); and 

2. ^4+(z) and U+O))- 1 are the z-transforms of /^sequences {a+} and 
{b+} with < = 6+ = for fc < 0, and ^~(z) and (A'tyy 1 are the 
z-transforms of / x -sequences {a~} and {&~} with a~ = b~ = for 
A:>0. 

Paw/; Both of these corollaries follow from Theorem 3.18 under a 
suitable choice of the Banach algebra a and the projections 7r+ and ir~. 
Corollary 3.18.1 follows from Theorem 3.18 with the Banach 
algebra a all members of jSf (/ a ,/ 2 ) such that if A e a and if {a kl } 9 kjsl 
is the corresponding array, then the sequences {a kl } belong to / x for 
fixed k and /, uniformly in k and /; i.e., there exists an M such that 
ST~«> \a kl \ < M and 2i=-oo \a kl \ < M. Multiplication is defined in 
the usual way as composition of elements of J?(I Z J 2 ), The norm is 

20 It comes somewhat as a surprise that Theorem 3.18 when applied to linear 
operators on & % yields factorization of doubly infinite matrices, just sufficient to 
ensure factorizability of the positive operators discovered earlier in this chapter. 
This result also hinges on the somewhat unexpected type of norm chosen in applying 
Theorem 3.18. Notice that this norm is not the one induced by the / 2 -topology. The 
reason for not using the induced norm is that it was not possible to show that 
||tt + || < 1 in this topology. Whether or not this is true remains unclear. For time 
invariant operators this question reduces to proving or disproving that for any 
g^ Li, 



max 



If 



g{t)e~3(atdt 



> max 



If 

I Jo 



g{t)e~3<*>t dt 



If this inequality holds then Theorem 3.18 can be used to show that any strictly 
positive operator on a Hilbert space admits a factorization into a causal and an 
anticausal part. 
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defined as the greatest lower bound of all numbers M satisfying the 
above inequalities. The nonobvious elements in verifying that a forms 
a Banach algebra are that a is closed under multiplication, that 
\\A3\\ < Ml ||£|| for all A, Bee, and that a is complete. Closedness 
under multiplication follows from Fubini's Theorem for sequences 
(Ref. 22, p. 245) and the inequalities 



+ 00 

2 

k=-oo 



+ 00 



and 



+ 00 

2 



+ 00 

2 a kAi 



+ 0O +00 

Je=~ coi=— oo 
+oo +oo 

= 2 \b a \ 2 Kl 

i=— oo ft=— oo 

< M|| ||£|| 



Mil 11*11- 



These inequalities also show that \\AB\\ < \\A\\ \\B\\. Completeness 
follows from the fact that l x is complete. The projection operator tt+ 
is defined by ir+A = B; if {a kl } and {b kl }, kjel are the corresponding 
arrays, then a kl = b kl for all k > /, and £ w = otherwise. The operator 
ir- is defined by tt~ — — tt+. It is clear that ||tt + || = 1 and that 
|| ff -|| == 1. The only fact that is left to be shown is that if Z — el is 
doubly dominant for some e > 0, then Z can be written as Z = 
pi— -4 with Mil < P- It is easily verified that any p with |/>| > 
sup**/ z fcfc yields such a decomposition. 

The proof of Corollary 3.18.2 is completely along the lines of the 
proof of Corollary 3.18.1 with the Banach algebra a consisting of all I x 
sequences, and with multiplication of A = {a k } and B = {b k } defined 
by AB = C = {cj, where c k = 2+-ao %-A and M|| = 5*— « 1^1- 
The projection operator tt+ is defined by tt+A = 5 with A = {#*.}, 
jB = {& fc }, 6 & = a A for k > 0, and ^ = for fc < 0. The operator ir- 
is defined by tt~~ = 6 — tt + . 

The factorization in Corollary 3.18.2 is valid under much weaker 
conditions than stated. Indeed, although dominance of the involved 
sequence is certainly sufficient for the factorization to be possible, it is 
by no means necessary — as is shown by the following theorem, due to 
Krein (Ref. 31). 
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Theorem 3.19 

Let A{z) be the z-transform of an / x -sequence. Then there exist 
functions A+(z) and A^(z) such that: 



A(z) = A-(z)A+(z), and 

A+(z) and (A+^z))- 1 are the ^-transforms of /i-sequences {a*} and 

{b+} with *+ = b+ = for k < 0, and A~(z) and {A~{z))-^ are the 

z-transforms of / r sequences {a k } and {b k } with a k = b k = for 

k>0 3 

if and only if A(z) ^ for \z\ = 1 and the increase in the argument 
of the function A(z) as z moves around the circle \z\ = 1 is zero. 
Moreover, all factorizations which satisfy conditions 1 and 2 differ 
only by a nonzero multiplicative constant. 



The factorization analogous to those obtained in Corollaries 3.18.1 
and 3.18.2 for operators on L 2 with time- varying kernels is straight- 
forward and will not be explicitly given. The analogue to Theorem 3.19 
for operators with a time-invariant kernel is Theorem 3.20; Theorem 
3.20, which is also due to Krein, is less restrictive than the analogous 
factorization obtained in Theorem 3.18. 

Let 9 X be a class of operators from L % into itself each element of 
which is determined by an element (g(t),{g k }) of L x x l x and by a 
mapping {t k } from I into R. The operator Ge^ maps x e £ a into y, 
With y(t) = 2£U g k x(t - t k ) + JiS g(t - t)x(t) dr. It is simple to 
verify that G is well defined, i.e., that it maps I, a into itself. Let 3Pf 
denote the subclass of @ x for which the determining element of L x x l x 
and the mapping {t k } satisfy g(t) = for / < and t k > for all k e I. 
Let ^r denote the subclass of ^ x for which the determining element of 
L x x l x and the mapping {t k } satisfy instead g-(f) = for t > and 
* fc < for all keL Clearly (7 e &t if and only if G* e #r. 

Theorem 3.20 

Let G e @ x and assume that all the delays are equally spaced, i.e., 
that t k = kT for some k. Then there exist elements G+ e <$ x and G~ e 9 X 
such that: 



and (G-)- 1 e <§\ 



: 1. 


G = G-G+, 


I 2 - 


G + and G~ are invertible, and 


i 3. 


G+ and (G+)" 1 e ^f, and G~ a 
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if and only if, \G(jco)\ > €, for some e > and all m e R, and the 
increase in the argument of the function G(jco) as co varies from — oo 
to + oo is zero. 21 



A slightly weaker version of Theorem 3.20 is given by Krein (Ref. 31). 
However, the extension of his proof to cover the form given in Theorem 
3.20 presents no apparent difficulties. 

The main interest in the above factorization of operators stems from 
the fact that it leads to the solution of the so-called Wiener-Hopf 
equation. The Wiener-Hopf equation plays a central role in applied 
mathematics (Ref. 34), and its importance in the analytical design of 
engineering systems cannot be overestimated. It is the logical conclusion 
of the optimization of quadratic functional subject to linear constraints. 
In fact, the celebrated Riccati equation — which plays a central role in 
control and detection theory — can be viewed as a method of solving a 
Wiener-Hopf equation. Among its applications are the optimal control 
of linear systems under quadratic performance criteria, minimum 
variance filtering of Gaussian processes, network synthesis, solution of 
boundary- value problems, etc. 

In the abstract notation of this section, application of the Wiener- 
Hopf equation amounts to asking for the solution X of the system of 
equations 

ir+(XR) = 7T+M and tt+X = X. 

Theorem 3.21 below shows how this solution can be readily obtained, 
provided that R admits a suitable factorization. First, however, a simple 
lemma will be proven : 

Lemma 3.11 

Let c be a linear algebra with a unit element J and let 7r + and tt~ = 
Q _ -^ be projections on a with ranges cr + and g~ respectively. Let 
A e a and B e a~ © /. Then tt+{(tt+A)B) = rr+{AB). 

21 Let G(ja>) 9 co G R, be a complex-valued function of a> with G(ja>) = A(j(o) -f 
L(jco) and A(jco) periodic and lim w ->±oo L(joS) = 0. The increase in the argument 
of the function G(ja>) is said to be zero if 

lim dsLTgGijco) « lim —. dsiVgA(j(o) = 0. 

This definition is a natural generalization of the usual definition of the increase in the 
argument of a complex-valued function. 
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Proof: Since 

7T+{AB) = 77+((7T+^ + W-A)B) 

= 7r+((7r+A)B) + n*((irA)B) = it+(kn*A)B), 
the lemma follows. 

Theorem 3.21 

Let a be a linear algebra with a unit element / and let ?r + and rr~ — 
— 7T+ be projections on a with ranges o+ and <r~ 9 respectively. Let 
R and M be given elements of <r, and assume that there exist elements 
R+ecr and Rr ea such that: 

1. R = R+R~; 

2. 7? + and R~ are invertible in a; and 

3. R+ and (R+)- 1 belong to o+ © /, and Rr and (R-)- 1 belong to <r* © /. 

Then the unique solution of the Wiener-Hopf equation, i.e., the 
solution of the system of equations tt+(XR) = tt+M, tt+X = X, is given 
in terms of this factorization of R by X = (^^(iO" 1 ))^*)" 1 . 

Proof: It will first show that X = (7r+(M(JR-)- 1 ))( J R+)- 1 is a solution of 
the Wiener-Hopf equation. Clearly rr+X = X. Moreover, 

7r+(XR) = Tr+iiir+iMiR-y^iR+yiR) 

= TT+CCTr+C^Ci?-)- 1 ))^) = ^((MlR-y^R-) = 7T+M. 

These equalities follow from Lemma 3.11 since (R + )~ x e or+ © / and 
iT e o*- © /. To show that (7r+(A/(i?-)- 1 ))(^)- 1 is the unique solution 
of the Wiener-Hopf equation, assume that tt+(XR) = tt+M and 
7r+Z = X. Then (7^(7iH-(JW^jR-)))(jr)-i = ^(M^-)- 1 ) since OR-)" 1 e 
<r~ © /. Hence, since JR+ e cr+ © /, the equalities ^(MOR-)- 1 ) = 
tt+(XR+) = A7J+ hold, which shows that (^(MCi?-)- 1 ))^)" 1 is 
indeed the unique solution of the Wiener-Hopf equation. 

3.8 Positive Operators on Extended Spaces 

As defined in Section 2.5, a causal operator F from L 2e into itself 
is said to be (incrementally) positive on L 2e if for all Te S, P T FP T is 
(incrementally) positive on L 2 . As was pointed out there, the following 
relation between positivity on L % and L 2e exists: a causal operator on 
L 2e which in addition maps L 2 into itself is (incrementally) positive on 
Lz e if and only if it is (incrementally) positive on L a . Hence nothing 
new is required to examine the positivity of operators on L 2e provided 
they map L 2 into itself. 
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For causal operators on L 2e which do not map L 2 into itself it is, 
however, a simple matter to generalize the theorems of the previous 
situation to operators on L 2e . 22 For example, Theorem 3.2 can be 
extended to cover the case where K(t) e L^ e (it suffices for K(t) to 
belong to L^ e in order for the operator (Kx)(t) = K(t)x(t) to be 
well defined as a map from L 2e into itself). The analogue of Theorem 
3.11 is also obvious and requires the analogue of the dominance 
conditions without the requirement that $\ip keI m kk < oo; instead, it is 
assumed that m kl = for k < /. This analogue, however, does not 
fully exploit the power of the rearrangement inequalities derived in 
this chapter, and it is in fact possible to prove a somewhat stronger 
positive-operator theorem, which does not require bound'edness of the 
operators involved. 

Definitions: Let M = {m kl }, with k, /e/+, be a square array of real 
numbers. Then M is said to be doubly hyperdominant if m kl < for 
k y£ I and if 2£L m kl and 2£= m n exist and are nonnegative for all / 
and k. M is said to be doubly dominant if m u > Xfc^o.a^i \ m ^ anc * 
m kk > 2^=0,^ |/w fc i|. Every doubly dominant array M, with k, /e /+ 
and m kl = for k < /, defines a causal linear operator from / 2e (0,co) 
into itself with (Mx) k = 2f=o m ki x i- This operator need not, however, 
be bounded on l 2e . Given two square arrays of real numbers M x = 
{mg 1 } and M % = {mj^}, &, /e/ + , it is possible to define the product 
M X M 2 as the square array with (M x M 2 ) kl A 2^o m lV m a } whenever this 
infinite sum exists for all & and /, and thus in particular whenever M x 
is causal (i.e., whenever M x defines a causal operator on l 2e ; in other 
words, whenever mfy — for k < /), since the summation defining 
this product then becomes finite. Let M — {m kl }, k, IeI+, be given; 
then M* = {n kl } 9 k, lel+ is defined as the square array with n kl = m ifc 
for all fc and /. 

Theorem 3.22 

Let L x and L % be causal linear operators from / 2e (0,oo) into itself and 
let F denote the operator on ZaeC * 00 ) defined by (Fx) k = f(x k ) with/ 
a map from R into itself. Assume that L x has a causal inverse on / 2<J (0, oo). 

22 It should be pointed out that Theorem 3,1 is an exception to this, i.e., that, in a 
sense, the bounded positive time-invariant causal convolution operators on L 2e 
define in fact the most general positive time-invariant causal operators on L Ze as 
well. More specifically, consider the convolution operator (as in Theorem 3.1) with 
vanishing kernel for t < (for causality) and whose kernel is locally integrable (i.e., 
integrable on compact sets) so that it defines a causal linear time-invariant operator 
on L ie . If for some AT" > 0, the integral of the norm of the kernel on the interval 
[T, T + AT] becomes unbounded as T-> oo, then this operator will not be positive 
on L 2e . 
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Then L X FL 2 is a positive operator on / 2e (0,oo) for any monotone 
nondecreasing (odd-monotone nondecreasing) first and third quadrant 
function /if and only if the square array determined by Z, a (£~ 1 )* is 
doubly hyperdominant (dominant). 

Proof: Let N be a nonnegative integer, let x e/ 2e (0,a>), and let F N x 
denote the truncation of x. The operator P N L X FL % P N is positive on R N 
if and only ifP N I^P N L l FL^ N (Lp)*P N is. Since, however, 

P N L-ip N L x FL 2 P A iL-TPx = P N FL 2 (L^rP N9 
the theorem follows from Theorem 3.10. 
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4 Feedback Systems 

4.1 Introduction 

Feedback remains the basic concept of control: some output variables 
of the system to be controlled are measured, and this information is 
processed to generate an input to. the system to be controlled such 
that the overall system behaves in some desired fashion. Feedback 
control thus involves cybernetics: observation, information processing, 
decision, and execution. It is believed that Norbert Wiener first had the 
insight to conclude that such cybernetic situations are intimately 
related to uncertainty: in the absence of uncertainty, there would be 
no need for feedback, and all decisions could be made "open loop." 
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However, no system (plant, sensor, or actuator) is free of uncertainty, 
and control engineers always resort, in the end, to feedback controllers. 
In this chapter the basic properties of feedback systems are studied: 
these include their well-posedness, stability or instability, and con- 
tinuity or discontinuity. Also, sensitivity is briefly mentioned. The 
feedback system to be studied is shown in Figure 4.1. 



4.2 Mathematical Framework 

A model applicable to most feedback systems is shown in Figure 
4.2. 1 The functional equations describing this feedback system are 

*?i = Wi - y 29 

e 2 = w a + y l9 

y t = G x e u and 

y z = G z e 2 , 



(FE) 



where w l9 w a are called the inputs; e u e 2 are called the errors, and 
y u y z are called the outputs. 

Let 7^ be a (finite) real number and let S, the time-interval of definition, 
be [r ,oo). 2 Let B ± and B 2 be given Banach spaces, and let 7(J5^), 



^® 



®- s 



Figure 4.2 The Feedback System under Consideration 



1 The assumptions implied by this configuration are: 1, additivity of the inputs 
and 2, only two input summing junctions are present in the loop. The methods 
presented here lend themselves easily to the study of more general configurations. 
The philosophy of using this configuration is that one assumes additive driving 
noise and additive noise in the measurements, the usual situation studied, for 
example, in combined estimation and control problems. 

2 The extension to discrete systems is a trivial one. The case 5 = (— oo,-fco) is 
not treated since the author is not convinced that this case is of physical significance. 
Some remarks regarding this case will be made in Section 4.6. 
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i = 1 , 2, denote the linear space of ^-valued functions defined on S, 
i.e., Y(3J&{x\x:S^BJ. 

Definition: Let TeS. Then P T denotes the projection' operator on 
Y(B^ i = l,2, defined for x e Y(BJ by 



(P T x)(t) A 



x(f) for t < T, t e S 
otherwise 



P T will be called the truncation operator and i^x will be called the 
truncation of x at time T. The collection {P T } 9 TeS, thus consists of a 
family of projection operators on Y{B^). 

Definition: Let M^c Y{B^), i — 1, 2, be a Banach space. The extended 
space, W ie , is defined as 

W* A{xe y(^) | P T x g PT. for all TeS,T finite}. 



Recall the definitions of a causal, strongly causal, and locally 
Lipschitz-continuous operator: Let F be a mapping from W ie , i = 1, 2, 
into itself. Then i? is said to be causal on W ie ifP T FP T = PyF on W ie . 
It is said to be strongly causal on W^ 6 if Fis causal on pr i6 and if for all 
TeS, e > 0, and T' e S, T < T, there exists a real number Ar > 
such that \P T >^ T (Fx - FjOfl < « II^WtC* - j)ll f° r any x, 7 e JF,, 
with P T >x = iVj>. It is said to be /octf/Zy Lipschitz continuous on H^ e 
if for all TeS, P T FP T is Lipschitz continuous on W v These notions 
are generalized in an obvious way to operators from W le to W Ze , 
W le x W %e to W u , etc. , and will thus be used freely in the latter context 
as well. 

For various technical reasons a number of additional concepts 
involving causal operators are needed. 

Definitions: Let F be a strongly causal operator from W ie , i — \,2, 
into itself. Then F is said to be strongly causal, uniformly with respect 
to past inputs, if for all T e S, e > 0, and T eS,T' < T, there exist 
real numbers AT > and K < 00 such that \\(P T > +AT - P^iFx - 
Fy)\\ < K \\P T >(x - 7)i| + 6 || (iWr - iV)(* - jOII for all x,j> e W ie . 
Locally Lipschitz continuous strongly causal linear operators are in 
fact strongly causal, uniformly with respect to past inputs. 

Assumptions on the spaces. The following assumptions are made on 
the spaces W t , i = 1,2: 
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W.l. The spaces Wi are closed under the family of projections {P T } 9 

TeS. 
W.2. For any x e W i9 the norm ||P r Jc|| is a monotone nondecreasing 

function of T which satisfies lim Tm ||iV*ll ^ 0andlim rtoo \\P T x\\ = 

||*|| . The family of projection operators {P T }, Te S, is thus assumed 

to be a resolution of the identity. 
W.3. If x e W i6 , then x e W t if and only if sup TeS \P?x\ = lim^^ 

\\P T x\\ < °°- 
WA For any real numbers t l9 t % such that T < t t < t % , the vector 
space W i{tiM) J±{xeWi\ x(t) — for t$ [t x ,t 2 ]} is a closed subspace 
of Wi and is thus itself a Banach space under the norm of W^ 

Assumptions on the Operators. The following assumptions are made on 
the operators G i9 i = 1, 2: 

G.L The operator G x maps W le into W 2e , and G 2 maps W 2e into W u . 
G.2. The operators G^ are causal on W i9 i.e., P T G { P T ~ P T G 4 on H^ 

for all TeS. 
G.3. The operators G^ are locally Lipschitz continuous on W ie , i.e., 

sup IIW^-^)I] <00 foraI1 reS . 

as^lSF, ||-Py(x — y) || 

G.4. The operators G^ satisfy Gfi = 0. 3 



Assumption on the Inputs. The following assumption is made on the 
inputs u iy i — 1, 2: 

1.1. The input */,. belongs to JT ie . 

The important notions regarding the feedback system under con- 
sideration are those of solutions, well-posedness, stability, and con- 
tinuity. Definitions of these terms are now introduced. 

Definition of a solution. Let u t e W ie , i = 1, 2, be given. Then the 
quadruple {e x ,e % ,y x ,y 2 } is said to be a solution of the feedback equations 
if: 

50.1. The inputs and outputs e l9 y 2 belong to W u , and e 2 , y x belong 
to W 2e . 

50.2. The feedback equations (FE) are satisfied by {e x ,e z ,y x ,y 2 }. 

3 Assumption G.4 can always be taken to be satisfied by proper redefinition of the 
operators. 
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The above definition of a solution is a natural one. The definition 
of well-posedness is somewhat more delicate. 

Definition of well-posedness. The feedback system described by the 
equations (FE) is said to be well posed if ': 

WP.l. There exists a unique solution for any pair of inputs u t e W ie , 
r= 1, 2. Let G denote the operator from W le x W 2e into itself 
defined for x = (x l9 x % ) e W l6 X W %e by Gx A (<? 2 x 2 , — C?iJCi). Con- 
dition WP. 1 thus requires that the operator / + G be one-to-one and 
onto, i.e., invertible on W le x W u . 

WP.2. The errors and the outputs depend on the inputs in a non- 
anticipatory way; i.e., P T e l9 P T e z , P T y x , and P T y % depend, for any 
T e S, on P T u x and P T w 2 only. Conditions WP.l and WP.2 combined 
thus require the operator / + G to be invertible on W u X W 2e and 
(/ + G)" 1 to be causal on JF le X PF 2e . 

WP.3. The errors and the outputs depend, on finite intervals, Lipschitz 
continuously on the inputs. Conditions WP.l, WP.2, and WP.3 
combined thus require the operator I + G to be invertible on 
W le x W u and (I + G)" 1 to be causal and locally Lipschitz con- 
tinuous on W le X W Ze . 

WP.4. The errors and the outputs are insensitive to modeling errors in 
the following well-defined sense: Consider the functional equations 

e x = S n (u l9 Gz x eJ 9 _ 

(PFE) 
e% = T^S %x {u %9 G Xk e x ) 9 

which describes the "physical" feedback system shown in Figure 4.3. 
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Figure 4.3 The Physical Feedback System 

It is assumed that the family of operators, G iA , i= 1, 2, are 
parametrized by a parameter A which is assumed to belong to some 
index set A. The set A will be assumed to be a subset of a normed 
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linear space and contain a neighborhood of the origin. Let T , e < 0, 
denote a time delay. The functional equations (PFE) are assumed to 
approximate the feedback system described by the functional 
equations (FE) in the sense that for all U\\ sufficiently small: 

1. The operators S lx , S 2x , G lx9 and G 2x map W u x W u into W U9 
W Ze x W u into W 2e , W u into W U9 and W Ze into W le9 respectively, 
and T € , e < 0, maps W Xe into itself by 

(x(t + c) for * > T - € 

(W)A 

10 for T < t < T - e , 

i.e., r, denotes a time delay with delay — e > 0. 

2. The operators S ix and G u , i = 1, 2, are fccaffj; Lipschitz con- 
tinuous on their respective domains. 

3. The operators S iX and (?„, / = 1, 2, are strongly causal on their 
respective domains. 

4. The "physical" feedback system described by the equations 
(PFE) approximates the feedback system described by the equation 
(FE) in the following well-defined sense: For any Te S and any 
i?i, w x e W le and t? a , w 2 e W U9 the operators 5 U and G ix satisfy 
lim^o S , lA (u 1 ,w 1 ) = v x - w ls lim A ^ 5 2A (i7 2 ,w 2 ) = v % + w a , 
lim A ^ G^ = G^, and lim^ G 2A z; 2 = G 2 v 2 . 

It then follows that for any TeS and ||A|| and -e > sufficiently 
small: 

1 . For any input pair w f e JF <# , i = 1,2, there exists a unique solution 
pair ^ e PF ie , i = 1 , 2, to the equations (PFE). Let e i = F ix (u u u 2 ) 9 
i = 1 , 2, denote these solutions. 

2. The operators F/ A , / = 1, 2, are caitta/ on their domain. 

3. The operators F? x , i = 1, 2, are /ooz/Ty Lipschitz continuous on 
their domain. 

4. The operators G 1X F* X and Tfi %x F £ %x are rfiwigfy «a#w*/ on tf«r 
domain. 

It is then required for well-posedness that for any input pair 
Ut e W ie , i = l,2, the solutions i^ satisfy 

lim lim (F^( Wl ,w 2 ), FJ^u^w,)) == (/ + G)'\u l9 u t ) 

Well-posedness is essentially a modeling problem. It expresses that a 
mathematical model is, at least in principle, adequate as a description 
of a physical system. The definition which follows is that of stability, 
which is a desired property of a feedback control system. 
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Definition of stability and instability. The feedback system described 
by the equations (FE) is said to be stable if it satisfies conditions ST.l 
and ST.2: 



ST.l. It is well posed. 

ST.2. Any inputs u t e W i9 i — 1, 2, lead to errors and outputs e Xt 

y 2 e W x and e z , y x e W 2 . 

The system is said to be stable with finite gain if it is stable and if in 
addition: 

ST. 3. There exist constants p l9 p 2 < oo sucn tnat f° r an Y inputs 
u< e W u i = 1, 2, UeJI, |k 2 ||, ||^||, ||j 2 || < ft Kl| + p 2 ||« 2 ||. 

The feedback system is said to be unstable if: 

INST.l. It is well posed. 
INST.2, It is not stable. 

A somewhat stronger desired property of a feedback controller is 
that of continuity. 

Definition of continuity and discontinuity. The feedback system de- 
scribed by the equations (FE) is said to be continuous if: 

C.l. It is stable. 

C.2. Let Ui e W i9 i — 1, 2, be given (but arbitrary), and let e t and y i 
be the corresponding errors and outputs. Then there exists, for any 
given e > 0, a d > such that the inputs u i + Aw^ with At^ e W t 
and HAkxII, ||Aw 2 || < e yield corresponding errors, e i + &e i9 and 
outputs, y i + &y i3 with Ae 1; Ay 2 e W l9 Ae 2 , Aj^ e PT 2 , and ||AeJ, 
l|Ae 2 ||,||A 7l || 5 l|Aj 2 ||<5. 

The system is said to be Lipschitz continuous if it is continuous and if in 
addition : 

C.3. There exists a K < co (independent of u i9 i = 1, 2), such that 
d = Ke will satisfy condition C.2. 

The system is said to be discontinuous if: 

DC.l. It is well posed. 
DC.2. It is not continuous. 
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4,3 Well-Posedness of Feedback Systems 

The question of well-posedness of mathematical models for physical 
systems is of fundamental importance in engineering. 4 

4.3.1 Justification of the Definition of Well-Posedness 

A mathematical model is generally said to be well posed if solutions 
exist and if these solutions are unique, continuous with respect to the 
input variables, and continuous with respect to modeling errors (i.e., 
errors in the parameters of the system). Well-posedness thus imposes a 
regularity condition on feasible mathematical models for physical 
systems. Well-posedness expresses the fact that a mathematical model 
can be, at least in principle, an accurate description of a physical 
system. In other words, since exact mathematical models would always 
be well posed, one thus requires this property to be preserved in the 
modeling. 

The definition of well-posedness as given in Section 4.2 requires 
justification. The assumptions on G imply that it itself be well posed, 
i.e., that it is a well-defined map, causal, locally Lipschitz-continuous, 
and a suitable approximation of a physical system. A "physical system" 
is defined as a locally Lipschitz-continuous strongly causal system. 
This concept of "physical system" is not standard and calls for an 
explanation. To require that a physical system be a continuous, well- 
defined and causal map is natural. The strong causality condition is 
inspired by the fact that any system exhibits some delay between the 
input and the output. No system can instantaneously transmit a signal, 
hence all physical systems have at least an infinitesimal delay. The 
second point to make about the definition of well-posedness is the 
precise mathematical interpretation of the idea of a "suitable approxi- 
mation." For F to approximate F k it is required that the operator 
F — F x should be small in some appropriate sense. One possible 
choice — the first one which comes to mind, and mathematically 
the most convenient one — is to require F — F x to be small in the 
uniform operator topology, i.e., to require \\P T (F - F A )|| A to be small 
for all TeS. This would then typically take into consideration errors 
in time constants, in gains, and in initial conditions. It is, however, 
not the choice adopted here since it is merely required that F should 

4 A basic paper on the well-posedness of feedback systems is that by Zames 
(Ref. 1). The setting considered in the present monograph is quite different, however. 
Reference 1 points out somewhat more carefully why the concept of "physical 
system' ' that is adopted here is a natural one. This is done in terms of ideal elements. 
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approximate F k in the strong operator topology, i.e., that (F — F x )x 
should be small for any fixed x. The reason for adopting this choice, 
which makes the problem of well-posedness mathematically much more 
difficult, arises from engineering considerations. Indeed, examples of 
typical effects that are invariably ignored in the mathematical model are: 
small time delays, band limitation over broad bandwidths, distributed 
effects, small time constants, and similar phenomena that are believed 
to happen much faster than any changes due to the dominant dynamics 
of the system or in the expected variations in the inputs. It can easily 
be verified that none of these effects is (in general) continuous in the 
uniform operator topology. The induced norm on Lf, 1 <j9 < oo, of 
the identity minus a delay of length X > 0, is always two. The same 
holds for the identity minus the system with transfer function If{Xs +1), 
X > 0, no matter how small A, and thus no matter now closely the 
impulse response (ljX)e~ tn "resembles" a unit impulse. These effects 
are close to the identity, however, in the strong topology on Lf , 
1 < p < oo, in the sense that lim^o F x x = x for any x e Lf , 1 < p < 

00. 

A careful examination of the definition of well-posedness also reveals 
that a pare delay T e , is assumed to be present in the loop describing 
the physical feedback system. This assumption is inspired by the 
consideration that no information can travel faster than the speed of 
light, which is a finite, albeit very large, constant. It thus seems . 
very reasonable to make such an assumption. It should be noted that 
this assumption has not been introduced capriciously, but that it is 
essential to the mathematical development. 

The conditions assumed on the physical model in the treatment of 
well-posedness appear to be minimal. Taking into consideration that 
the final conditions will be in terms of the mathematical models, no 
additional assumptions on the physical systems will be made. This is 
in the spirit that the exact description of the physical system is unknown 
but that smoothness assumptions can be made, these being warranted 
by general physical principles. 

The assumption made is thus that the summing junctions and the 
systems in the forward and the feedback loop of the system are approxi- 
mations (in the strong topology) of the exact physical (strongly causal 
and locally continuous) system. The requirement for well-posedness 
of the closed loop system is then that it itself approximate the physical 
system obtained by considering exact physical models for the summing 
junctions and the systems in the forward and feedback loop. Thus well- 
posedness of the open-loop operators (i.e., the open-loop operators are 
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approximations of physical systems) should translate into well- 
posedness of the closed-loop system. 

A final word of caution in the interpretation of the well-posedness 
condition is necessary. The requirement is not that 

. lim (*u(tti>"s)> Kl(Mi>uJ) = (/ + GT\u Xy u 2 \ 

elO.A-K) 

but precisely that 

lim lim (Fl k {u x ^ 2 \Fl x (u x ,u z )) = (I + Gy\u u u 2 ). 

It will be remarked later on that this unfortunate situation appears to 
be the best that can be obtained. This phenomenon is not unlike similar 
difficulties encountered in stochastic systems (Ito calculus) and differ- 
ential games. 

432 Examples of Ill-Posed Feedback Systems 

A surprising fact concerning well-posedness of feedback systems is 
that the operators appearing in the loop need not be pathological to 
result in an ill-posed feedback system. A number of interesting examples 
of ill-posed feedback systems can be found in the literature (see Ref. 
1, and further references therein). Three other examples are the 
following : 

1. Let G x = —I and G z = I, Then the feedback system has no 
solutions if u x ^ w a and has multiple solutions if u x = w 2 - 

2. Assume that the forward loop consists of a delay of length T > 
minus a unit gain, that G 2 = I, and that u % = (see Figure 4.4). The 
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Figure 4.4 The Feedback System of Example 2 

error is then (uniquely) given in terms of the input by e x (t) = u x (t + T) 
for t > To and thus depends on the input in an anticipatory way. 

3. Assume that all signals in the loop are real-valued functions on 
|T ,go), that the forward loop consists of a constant gain K y that 
G 2 = 7, and that w 2 = (see Figure 4.5). If K ^ - 1 , then this feedback 
system has a unique solution e x {t) = (1 + K)~ x u x {t) for any u x . If a 
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small delay of length k > is introduced in the forward loop, then the 
error corresponding to a nonzero constant input u^t) — u Q = con- 
stant t 6 is given by 

(tin for < t < k 

Ku Q for k < t < 2k 





w 



*i(0 = 



w - Ku + K*u Q + (-K) n u for nk < t < (n + 



l)k 



Hence if k -> 0, e(0 approaches (1 + iT)" 1 ^ (the solution for k = 0) 
z/awrf om/j i/[JS:| < 1 (when the expansion (1 + K)' 1 = 2»=o ("^) n 
is valid). The result obtained in this example is characteristic of the 
conditions for well-posedness which will be obtained next. 



Figure 4.5 The Feedback System of Example 3 



4,3.3 A Condition for Well-Posedness 

Well-posedness imposes a weak condition on the operators in the 
feedback system. One such condition is given in the theorem that 
follows. First, however, a few more definitions. 

Definitions: Let F be a locally Lipschitz-continuous causal operator 
from W e into itself. Let TeS and Ar> be given. Then the real 
number 

\\P T+AT (Fx - Fy)\\ 
\\P T + AT (x - y)\\ 



PT (&T) A 



sup 

x.yeVVe 



is called the gain of F on the interval [T, T+ ATI. It is clear that 
Py(Ar) is monotone nondecreasing in AT for T fixed. The real number 
p T Ainf AT>0 p T (AT) = lim ATio /> r (Ar)is sometimes referred to as the 
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instantaneous gain of F at T. Note that p T = for a strongly causal 
operator. 

Let F be a locally causal Lipschitz continuous operator from W e 
into itself. Let TeS and AT> be given. Then there exist real 
numbers K T < 00 and M T < 00 such that 

\\(P T+AT - P T ){Fx - 150|| < K T \\{P T {pc - j0|| 

+ M T \\(P T+AT -P T )(x-y)\\ 

for all x, y e W e . The greatest lower bound of all real numbers M T 
satisfying this condition for all x y y e W e and some K T < 00 will be 
called 5 the uniform gain of F on the interval [T, T + AT*]. If Fis linear 
then its gain and its uniform gain on the interval [T, T + AT] are 
equal. The definition of uniform instantaneous gain is now obvious. 
The following theorem states sufficient conditions for well-posedness. 

Theorem 4.1 

The feedback system described by equations (FE) is well posed if 
either of the following conditions is satisfied for all TeS: 

1. The product of the uniform instantaneous gains of the operators 
G x and G 2 is less than a < 1. 

2. The system is linear and the instantaneous gain of the Nth power 
of the open-loop operator, (G^G^, is less than unity for some 
integer N > 0. 

Proof: The proof is an adaptation of the usual proofs involving 
existence and uniqueness of solutions of ordinary differential equations 
or Volterra integral equations. 6 Since the interested reader will be 

5 Whether or not this terminology is natural is unclear. The idea of instantaneous 
gain and the gain on the interval [T, T + &T] is obvious and clearly related to 
the feedthrough in a system. The uniform instantaneous gain and the uniform gain 
on the interval [T 9 T + AT] are also related to the feedthrough in a system but in 
addition take into consideration that the state of the system could be different. 

6 The type of proof given here can be completely imbedded in a contraction- 
mapping argument, thus avoiding the argumentation on consecutive intervals. It 
involves redefining the norm of Pt Q+n &px to the equivalent norm 

11 v»A^ii a i p\ Hp t0+iA T - v<*-w*ii> 

with pi appropriately chosen nonzero constants. Both arguments are tedious and 
painful. The one given here appears to be the more transparent one. The contraction- 
mapping argument is somewhat more sophisticated and yields an important side 
result: it shows that the successive approximations e n+1 = — Ge n 4- u converge on 
any interval [t Qt t + T] to the solution and it gives a (more or less) explicit bound 
on the rate of convergence and on the error. Similar ideas can be found in References 
2,3. 
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quite familiar with these arguments, let a somewhat sketchy proof 
therefore suffice. 

Consider first case 1. The feedback equations (FE) can be written 
ase = — Ge + wwithu == (u u u^),e = (e u e 2 ),SLndGe & (G % e % > — Gi^i). 
Consider the product space W x x W % and its extension W le X W %t . 

Let a x a IIPro+AtfGfl^ro+ArlL and *» — li^r^ArGi^WArlL- Let 
AT > be such that a x a 2 < 1. Such a AT > exists by the assumptions 
of the theorem. Consider now the equivalent norm on W x X W 2 
defined by 

11*11 - ||(*i,*»)|| A ft Il*^ + ft ||x.|lwv 

with oq 1 > ftft" 1 > a 2 . Such a ft and ft exist since a x a 2 < 1. A simple 
calculation now shows that P To +at gf t +at is a contraction on 
W MTQtTo+ATo) x WW 0i2V A2v ^ thus follows that the operator 
P Tq + at (I + G)P To + AT is invertible on the spaces restricted to the 
interval [T , T + AJ]. The argument is then repeated on consecutive 
intervals, and it follows from continuity that these intervals will cover 
the whole half-line |T ,oo). This argument thus yields existence and 
uniqueness of solutions and condition W.l for well-posedness. Causality 
of the inverse follows from the fact that the invertibility is based on the 
convergence of the successive approximations e n+x = Ge n -f u, n el*? 
with e Q arbitrary. This approximation technique then yields causality 
of the inverse when used together with the above construction of the 
solution. Condition WP.2 for well-posedness hence follows. Notice 
the conditions WP.l and WP.2 did not use the uniformity of the gain. 
The argument given here thus also proves Theorem 2.16. Condition 
WP.3 follows readily from this uniformity and the successive approxi- 
mations on consecutive intervals. To show condition WP.4, observe 
that the first part of this theorem yields, for e < sufficiently small, 
the existence, causality, and Lipschitz continuity of the operators F^ 
and Fl x introduced in condition WP.4, 

To show that (/ + G)' l (u x ,u 2 ) = lim €to lim^ (F' u (u x ,u 2 ) 9 F^(u u u 2 )) 
one proceeds as follows: Let — £ > be fixed and consider the operator 
G e defined by G € (e l9 e 9 ) = (-Tfi^, ff^). Let e x e W u X W 2e and con- 
sider now the successive approximations obtained by defining e n+1 — 
r-G € e n + u and e x n+1 = F(e n ), where Fis the operator on the right-hand 
side of the equation (PFE). By the strong convergence (assumption 4 
of condition WP.4) it follows that for all ne/+, lim A _> <?! n = 
e%. Moreover, lim^^^ < A = e\ and lim*.^ e% = e% and these limits 
are uniform since — e > (in fact, all series involved are really 
finite series). Thus lim A _ >0 e\ = e\ Comparing now the solutions of 
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e € = — G e e € + u and e -= — Ge + u, one observes that they also can be 
obtained by successive approximations and that the convergence is, by 
the fact that the product of the instantaneous gains is less than unity, 
uniform in e. Thus lim e | e € = e, as claimed. This procedure is then 
continued on the successive intervals. It remains to be shown that the 
operators G n F[ x and Tfi %x F\ h are strongly causal. This, however, is 
immediate since for all — € > there is a delay in the loop. This ends 
the proof for the nonlinear case. 

The refinement for linear systems follows essentially the same lines 
as the nonlinear case but with the contraction mapping principle 
replaced by Cacciopoli's corollary (Ref. 4). The same recursive 
algorithm works, and the argument only changes in detail. These 
details are left to the reader. 

43 A Discussion of the Well-Posedness Condition 

Theorem 4.1 imposes a restriction on the amount of feedthrough in 
G. Some important particular cases are given in Corollaries 4.1.1-4.1.3. 

Definition: Let J 7 be a causal operator from W le into itself. Then F is 
said to delay all inputs if for some e > the operator F € defined as 
F e x(t) A Fx(t + €) is also causal (i.e., F can be cascaded with a 
predictor and the composition remains causal). 

Corollary 4.1.1 

The feedback system described by the equations (FE) is well posed 
if the open loop operator G^Gx delays all inputs. 

Proof: This corollary follows from Theorem 4.1. 

Corollary 4.1.2 

The feedback system described by the equations (FE) is well posed 
if the operator G x is strongly causal, uniformly with respect to past 
inputs. 

Proof: This corollary follows from Theorem 4.1 and the fact that G % 
is locally Lipschitz continuous. 

An important example of a strongly causal operator is when the 
output y is related to the input u through the ordinary differential 
equation 

*(0=/(*(0.«(0,0 

y(t) = g(x(t),t) 

with t > t and x(t = t ) given, where xeB x , ueB u5 and yeB v 
with B x , B u , and B y given Banach spaces, and /and g are continuous 
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in / for t > r and Lipschitz continuous on respectively B x X B u and B x . 
It is easy to show that G defined by y A Gu is then a well-defined 
strongly causal operator, uniformly with respect to past inputs, mapping 
from Lf- into Lf/, 1 < p < oo. 

Definition: Let g be an operator from B x [r ,oo) into itself. Then the 
operator G defined by y(t) A g{u(t),t) will be called the memoryless 
operator with characteristic g. 

Corollary 4.1.3 

Let W x = L**(T Q9 co), 1 <p < oo, and G 2 G X = G' + G" be linear, 
with (?' a memoryless operator and G" strongly causal, uniformly with 
respect to past inputs. Assume that the characteristic of G', g 9 is 
continuous in / on |T 05 oo) and continuous on B. Then the feedback 
system described by the equations (FE) is well posed if g satisfies for 
some a < 1 and all t in any finite interval the inequality 



lim sup (\1AM) 



1/iV 



<a< 1. 



Conversely, if for some integer N > 1, some element u x e B u some 
t > f , and some A, |A| > 1, the characteristic satisfies g^(u u i) = Xu l9 
then the feedback system described by the equations (FE) is ill posed. 

Proof: The well-posedness part is a simple consequence of Theorem 
4.1. The ill-posedness part evolves along the lines of Example 3 of Sec- 
tion 4.3.2, Details of the proof can be found elsewhere (see Ref. 5). 

Well-posedness thus results if the spectral radius of g is less than 
unity, and ill-posedness results if g x has an eigenvalue outside on the 
unit circle for some / > t Q . In particular, if B x is a finite-dimensional 
space, then the condition that all the eigenvalues of g be inside the unit 
circle for all t > t Q becomes a necessary and sufficient condition for 
well-posedness. It can be shown, moreover, that the decomposition of 
G 2 (7i as given in Corollary 4.1.3 is canonical for linear systems. 

Remark 1: The question of well-posedness on the extended space when 
the time-interval of definition is (— oo,+ co) is much more intricate and 
is intimately related to the continuity of the feedback system. This 
aspect will be further discussed in Section 4.6. 7 

7 See also Reference 6. The author is not convinced, however, that the study of 
feedback systems with the time-interval of definition (— co, + oo) rests on physical 
grounds. 
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The result obtained in Theorem 4.1 has important implications for 
the engineer who is modeling a feedback system. Since it is not always 
desirable to model with extreme accuracy, it often happens that one 
likes to model the open loop with a feedthrough. Theorem 4.1 then 
gives very specific constraints on this feedthrough for well-posedness. 
These conditions are, moreover, all but necessary and sufficient for 
linear systems. If the conditions are not satisfied and the feedthrough 
is too large, then the feedback system will most likely be ill-posed and 
the mathematical model should be modified. This will probably most 
easily be done by taking into consideration some higher-order effects 
until the desired limitation on the feedthrough is obtained. This will of 
course generally augment the complexity of the system, in particular 
the dimension of the state space. 

Remark 2: One of the conclusions of well-posedness states that 
lim lim (F\ k {u X9 u^Fl k {u X9 u^) = (/ + G) _1 (ii ls ii a ), 

and the limits on the left-hand side are, somewhat unfortunately, to be 
taken in that specific order. An example which shows the necessity of 
this unfortunate situation is a feedback system with the gain k{t) — 1 
for r <r< T 4-A, A>0, and zero otherwise, a delay of length e 
in the forward loop, and a unit feedback. It is then clear that if A -> 
and e — ► 0, the feedback system — albeit well posed — yields a non- 
existent limit for lim e f (Fl Ji (u u u^ 9 F* x (u u u 2 )) for X > 0. Notice also 
that this difficulty remains when the inputs are assumed to be smooth. 



4.4 Stability and Instability 

From a mathematical point of view, stability is analysis: given a 
mathematical equation, one bounds certain quantities. Optimal 
control, on the other hand, is synthetic in nature: an unknown is to be 
determined in some optimal fashion. Notwithstanding these analytical 
aspects, stability theory has been very successfully used as the basis for 
the synthesis of feedback controllers. 8 In fact, essentially all the 
classical design techniques are directly inspired by stability considerations. 

Modern control theory tends to put heavy emphasis on optimal 
control and with it on general optimization theory — the derivation of 

8 Similar synthesis techniques should be developed using modern stability theory 
concepts. This has not been done as yet although it could lead to successful design 
techniques for nonlinear and/or time-varying systems. 
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necessary conditions for optimality, and solution techniques based on 
these, usually in the form of algorithms. Does this mean that stability 
theory is passe or, at best, a side issue in modern control theory? It 
appears not. Mathematical optimization theory allows only in the rarest 
circumstances for an exact mathematical design of a feedback con- 
troller, and, even if this were possible, such a controller is then almost 
never implemented, be it because too many sensors are required or 
because of some other technical difficulty. Thus, designs based on 
optimization theory usually resort to a simplified mathematical model 
and deal with sometimes crude approximations at all stages of the 
development. It should also be recognized that stability is often the main 
concern in control, rather than optimality with respect to a (sometimes 
somewhat arbitrary) mathematical performance criterion. In other 
words, the main issue in many control problems is to guarantee that a 
plant will operate in the neighborhood of a desired operating point in 
the face of disturbances. Precisely how the plant returns to this operating 
point after a disturbance is quite often of secondary importance. Such 
control problems pose two challenges to the designer : 1 , an optimization 
problem (often a nondynamical optimization to determine the most 
efficient steady-state operation, or an open-loop dynamic optimal 
control problem to determine the optimal transfer); and 2, a stability 
problem (to guarantee that the closed loop control will keep the system 
variables around their optimal values). 

Two distinct approaches to the problem of stability of systems can 
be taken. The first — and more traditional one — regards stability as 
an internal property of a system: the system is considered as excited by 
an initial condition, and boundedness or convergence of the state for 
future time is taken as the basic requirement for stability. The second 
approach — and, from a modern system theory point of view, a more 
logical one — regards stability as an input-output property: the system 
is regarded as a mapping between normed spaces, and boundedness of 
this map is taken as the basic requirement for stability. 

Input-output stability is, from an engineering point of view, a very 
significant and important type of stability. The informal definition of 
stability given for instance by Nyquist in his classic paper on "Re- 
generation Theory" (Ref. 7) is essentially that of input-output stability. 
It is intimately related to the idea of "stability under constant dis- 
turbances" and thus has some classical — although not system- 
oriented — foundations. The concept of input-output stability stands in 
direct competition with the idea of stability in the sense of Lyapunov. 
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Input-output stability considers as the disturbance entering the system 
a constantly acting input, whereas stability in the sense of Lyapunov 
considers the initial conditions as the disturbance in the system. Which 
of these two types of stability is to be preferred clearly depends on the 
particular application. In a sense, input-output stability protects 
against noise disturbances whereas Lyapunov stability protects 
against a single impulselike disturbance. 

As is apparent from the definition formally introduced in Section 
4.2, the particular state space is not relevant in the concept of input- 
output stability and usually does not enter into the development of 
stability conditions in an essential way. The simplicity of the results 
usually depends — besides on the system itself— on the input and 
output spaces. These could of course be very simple, e.g., consisting 
of scalar functions of time, even though the state space and the state 
transition process could be very complex. Many systems encountered 
in engineering applications do indeed have this property: there are few 
output sensors and few control manipulators, but the state space and 
the state transition process are very complex and governed, for example, 
by a partial differential equation. This particular aspect of input-output 
stability makes this concept particularly appealing when studying the 
stability of distributed-parameter systems and, more specifically, those 
described by partial differential equations where difficulties of a 
theoretical nature generally limit the applicability of Lyapunov-based 
methods. 

Lyapunov stability 9 considers stability as an internal property of a 
system, and inputs and outputs do not play a role. This formulation 
accounts for the early development and great historical importance of 
this type of stability. The study of systems without inputs and outputs 
is indeed basic to classical dynamics. The traditional question of the 
stability of the solar system, for example, remains a long-standing 
challenge and does not involve inputs in any way. It is thus more than 
natural that stability of control systems has been studied in this context; 
namely, as a condition on undriven classical "dynamical systems." 



9 Most of the results presented here have their analogue obtained using Lyapunov 
methods. The survey paper by Brockett (Ref. 8) contains one such method based on 
spectral factorization and ample references to other works in this area. The same 
author presents in Reference 9 another method using algebraic matrix equations 
to obtain similar results. This method was originally developed by Popov (Ref. 10) 
and Kalman (Ref. 11). A survey paper by the author (Ref. 12) contains further 
references to the literature in this area, particularly those applying to distributed 
parameter systems as well. 
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This is in spite of the fact that its founders, Lyapunov and Poincare, 
were not primarily interested in control. It should be noted that this 
dynamical-system point of view is supported by the work of Maxwell 
and much of the subsequent work on regulators. Although Lyapunov 
stability remains important and very useful in many control applica- 
tions, its basic philosophy can often be challenged and is somewhat out 
of line with the modern approach to systems, where inputs and outputs 
are the fundamental variables and the state is merely an auxiliary 
variable that essentially represents the contents of a memory bank. The 
development and success of input-output stability should thus come as 
no surprise. This does not exclude that for many applications (for 
example, in aerospace problems) Lyapunov stability does represent a 
very satisfactory type of stability, and thus its study will remain both 
important and fruitful. 

4.4.1 Discussion of the Definition of Stability 

Although the concept of input-output stability (the type of stability 
studied here) is relatively old, its actual development is of a rather 
recent date. The basic idea of this type of stability is simple: the system 
is considered as a mapping between normed spaces — the input space 
and the output space — and boundedness of this map is taken as the 
basic requirement for stability. This boundedness of the input-output 
mapping then yields a bound on the norm of the output in terms of the 
norm of the input. 

This informal definition of input-output stability uncovers on close 
examination a basic difficulty: it is not a priori clear what to take for 
the output space. More specifically, suppose that the output, y, is 
given in terms of the input, u, through the mapping y = Fu. If u 
belongs to a normed space U 9 then input-output stability roughly 
requires that y belong to a normed space 7. The difficulty is that it is nbt 
a priori clear whether the output y — Fu will even be defined for all 
ueU. This could of course be added as an additional requirement for 
stability; i.e., it could be understood that the definition of stability 
requires that u e U imply that y = Fu e Y . Very often, however, the 
output y = Fu is actually well defined for all u e U (as an element of 
some larger function space), even when the system is not input-output 
stable. This possibility of extending the basic input and output spaces is 
the underlying idea behind the introduction of extended spaces and is 
appropriate when F is causal. The introduction of extended spaces 
is the key fact which has led to the very successful application of 



STABILITY AND INSTABILITY 105 

functional methods in stability theory. Extended spaces were in fact 
first introduced in this context. 10 

The preceding remarks hold — amplified — when the input and the 
output are related through an implicit equation. Such implicit equations 
describing the input-output behavior occur specifically when the 
system is of a feedback type and are hence of more interest in stability 
analysis, since control naturally leads to a feedback configuration and 
stability problems. Recall that all the operators in the feedback system 
described by the equations (FE) are assumed to be causal. 

Causality is a fundamental property of physically realizable systems, 
since it merely expresses that past and present output values do not 
depend on future input values. There is, however, another fundamental 
reason for treating input-output stability in the context of causal 
systems only. This reason is that stability requires some type of con- 
vergence as t — ► oo and thus, unless the past and future play essentially 
different roles in the original system, such a definition, which reflects 
the future behavior only, would appear to be ill founded. 

Considering now the actual definition of stability as given in Section 
4.2, observe that it requires well-posedness and some type of bounded- 
ness of the outputs in terms of the inputs. The first condition is not 
really necessary and one could very well define and study stability 
without reference to well-posedness. Although such an approach does 
not violate any mathematical principles, two points should be kept in 
mind: 

1. Well-posedness is a much more fundamental requirement than 
stability and should always be verified anyway. 

2. When a feedback system is not well posed, it does not adequately 
describe the physical system it attempts to model. To correct this 
situation one will thus have to modify the mathematical model 
somewhat and such a modification could very well alter some funda- 
mental properties of the feedback system as, for instance, its stability 



10 The idea of introducing the concept of extended spaces is due to Sandberg 
(Ref. 13) and Zames (Ref. 14). The subsequent development of input-output 
stability is largely due to the same authors. Survey articles summarizing their results 
are References 15, 16. The last reference is particularly valuable since it bases its 
rigorous analysis on a great deal of intuitive and physical reasoning. The approach 
followed here is presented in Ref. 6. The idea of extended spaces has appeared 
implicitly in the mathematical literature in the context of ordinary and partial 
differential equations, Volterra integral equation, and delay-differential equations. 
Hopefully, this concept will eventually be formally introduced in the mathematical 
literature for its elegance in treating causal operators and continuations as the one 
involved in the proof of Theorem 4.1. 
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properties. 11 It is thus natural to make well-posedness an a priori 
requirement for stability. 

The second ingredient in the definition of stability is the requirement 
that inputs in nonextended spaces should lead to outputs and errors 
which are also in nonextended spaces. Thus "small" inputs should 
generate "small" outputs and errors. Notice also that this boundedness 
singles out the zero solution as the desired solution and ensures that a 
"small" noise driving the system will generate a correspondingly "small" 
output. It is thus the natural condition to impose on systems which are 
regulating the output around a fixed reference (taken without loss of 
generality to be zero). 

4.4.2 Conditions for Stability 

In this section some general and some specific conditions for stability 
are derived. The first theorem states a general result which is merely a 
rephrasing of definitions. 

Theorem 4.2 (Stability) 

Consider the feedback system described by the equations. (FE) and 
let G map W le x W 2e into itself according to G(e u e 2 ) = (G 2 e 2 , "" G i e J- 
Then the feedback system is stable if and only if: 

1. It is well posed. 

2. The inverse (/ + G) _1 (which exists on W l6 X W 2e by condition 1) 
maps W x X W 2 into itself. 

The system is finite gain stable if and only if in addition 

3. The inverse (/ + G)' 1 is bounded on W x x W 2 . 

Proof: The definition of stability and Theorem 2.1 lead immediately 
to this result. 

Theorem 4.2 leads to the following interesting alternate definitions 
of stability. 

Alternate definition of stability (I). The feedback system described by 
equations (FE) is finite gain stable if and only if: 

ST. I. It is well posed. 

ST.2. For any inputs w^ e W ie9 i — 1,2, there exist constants p l9 p 3 < co 
(independent of the inputs and T) such that for all TeS, \\P T ^ih 

PVill, iiiVilL 11^.11 < piWPtM + p 2 \\Ptu%1 

11 This follows, for instance, from the Nyquist Criterion and the Circle Criterion. 
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Alternate 12 definition of stability (II). The feedback system described by 
equations (FE) is finite gain stable if and only if: 

ST.l. It is well posed. 

ST.2. For any inputs u t e W ie with 



1 



sup 

T>To T - T 



\\P T Ui\\ < a), i = 1, 2, 



there exist constants p l9 p 2 < go (independent of their inputs and T) 
such that 

1 1 

sup - i|iVili> sup \\P T e 2 \\, 



T>T T - T 

1 

sup 

T>To T - T 



T>To T -T 



\\P T yi\\, sup 



T>n T - T 



IliVM 



< Pi sup — || P T u x || + p 2 sup ™ ||P r w 2 



T>To T - T 



T>T T - T 



It can be argued successfully that the alternate definition (I) reflects a 
more fundamental boundedness of the response and should hence 
(although being equivalent to the original definition) be taken as the 
basic definition of stability. Definition (I) indeed expresses a bound 
on the response in terms of the inputs but allows for a larger and a 
somewhat more realistic class of testing inputs. This appears to be an 
important point since most noise inputs fall in the latter class and 
stability thus indeed expresses some boundedness of the response to 
such inputs. 

Theorem 4.2 leads to the following theorem for stability in terms of 
the open loop operator. 



Theorem 4.3 

Consider the feedback system described by the equations (FE) and 
assume that: 

1 . It is well posed. 

2. The operator G 2 is Lipschitz continuous on W 2 . 

3. The operator G x is bounded on W x or there exists an e > such that 
for all x e W u and TeS, \\P T G 2 x\\ > e \\P T x\\. 

12 This formulation has interesting applications to problems in which the inputs 
are described in terms of random processes. 
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Then / + G 2 G X is invertible on W u and the feedback system is finite 
gain stable if and only if ||(/ + Gfi^W < go on W x . 

Proof: The operator I + G 2 G X has a causal inverse on W u by well- 
posedness. Let u x e W x and w 2 e W 2 be given, and let e x e W u be the 
corresponding error. Then e x + G 2 G x e x = w x + G^G^i — ^a(^i e i + w a )- 
From the Lipschitz condition on G 2 it thus follows that for all TeS, 
\\P T (I + G 2 G x )e x \\ < WPj.uA + IIG.IL BP^II- Since (/ + G.Gyr 1 is 
causal and bounded on W u , 

ll/Vill < II (/ + G»GJri|| 1|iP2i(/ + G.GJeJ. 

Thus UiVJ < ||(7 + G % G x yi\\ \\u x \\ + ||(7 + G 2 G X )^\\ ||GJ A ||tt,||. 
This shows that e x -e W^ and that the finite gain condition is indeed 
satisfied. Since e x = u x — j 2 , this yields immediately that y 2 also 
satisfies these conditions. It remains to be shown that e 2 and y x e JF 2 
and that their gain conditions are satisfied. If \\G X \\ < co, this 
follows immediately. If the second condition in assumption 3 is 
satisfied, then it follows that <= \\P T e 2 \\ < llPyjy, which shows that 
e 2 e W 2 and that ||e 2 || < e -1 ||j 2 ||. From e 2 e W 2 and e % = u % + j/ x , it 
then follows that y x also satisfies such conditions. The necessity part of 
the theorem is obvious by letting w 2 = and applying Theorem 4.2. 

Notice that under the assumptions of Theorem 4.3 finite gain 
stability results if and only i/ there exists a real number e > such that 
for all T e S and x e W x > \\P T (I + G 2 G x )x\\ > e \P T x\. 

It is now a simple matter to use the invertibility theorems of Chapter 
2 to obtain more specific conditions for stability. 13 These conditions are 
stated in the following corollaries. 

Corollary 4.3.1 

The feedback system described by the equations (FE) is finite gain 
stable it is well posed, if G 2 is Lipschitz continuous on W 2 , if G x is 
bounded on W x , and if the open loop is attenuating on W x (i.e., if 

II6.G1II < 1). 



Corollary 4,3.2 

Let W x = W 2 . Then the feedback system described by the equations 
(FE) is finite gain stable if it is well posed, if G 2 is Lipschitz continuous 

13 These results are essentially due to Zames (Refs. 14, 16) and Sandberg (Refs. 
13, 15). The only novel part in the Corollaries as presented here is that the operators 
G % and G x are never separated (compare the HG^All < 1 conditions with the 
1 1 ^a|| ||Gi|| < 1 condition of Ref. 16). A theorem of the positive-operator type has 
appeared in Ref. 17 in a Lyapunov setting. 
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on W 29 and if there exists a scalar c such that I + cG x has a causal 
inverse on W u such that (/ + cG^) -1 is bounded on W x and 
((? 2 - cI)G x (I+ cG^' 1 is attenuating on W x . 

Corollary 4.3.3 

Let W x = W 2 be a Hilbert space. Then the feedback system described 
by equations (FE) is finite gain stable if it is well posed, if G 2 is Lipschitz 
continuous on W 2 > and if for some real numbers a < b, b > 0, the 
operator G 2 is strictly inside the sector [a,b], I 4- \{a + b)G x has a 
causal inverse on W le , and G x satisfies one of the following conditions: 

1. a < 0, and G x is inside the sector [—1/6, — 1/a] on W x \ 

2. a > 0, and G x is outside the sector [—1/a, —l/b] on JV le ; or 

3. a = 0, and G x + (l/b)I is positive on W u . 

Corollary 4.3.4 

Let W x = W 2 be a Hilbert space. Then the feedback system described 
by equations (FE) is finite gain stable if it is well posed, if G 2 is positive 
on W 2e , and if G x is strictly positive and Lipschitz continuous on W x . 

Proof: These corollaries follow from Theorem 4.3 and Theorem 2.17, 
Theorem 2.18, Corollary 2.20.1, and Theorem 2.21 (occasionally with 
the roles of G 2 and G x reversed). The details are left to the reader. 

Remark 1: It goes without saying that the roles of G x and G 2 can be 
reversed in the preceeding theorems. These theorems also remain valid 
if the conicity or positivity conditions are stated in terms of a causal 
factorization of G 2 G X (i.e., in terms of causal operators G 2 and G{ such 
that G 2 G i= G 2 G X on W x ). 

Remark 2: Notice that Corollaries 4.3.1 and 4.3.4 are very intuitive in 
nature. Corollary 4.3.1 states the stability results if the open loop 
attenuates all signals, and Corollary 4.3.4 states that stability results if 
the feedback system can be modeled as the parallel interconnection of 
passive systems (see Section 2.8.3). 

4.4 J Conditions for Instability 

In this section a specific condition for instability is given. It is in a 
sense the converse of Corollary 4.3.2. 

Theorem 4.4 (Instability) 

Consider the feedback system described by equations (FE) and 
assume that it is well posed and that G 2 G X is Lipschitz continuous on 
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W x . Let W[ and G' be a backwards extension of W x and G 2 G X from 
5= [To* 00 ) to S' = (— QOy+oo), and assume that G' is Lipschitz 
continuous on W x and that for all lef, the operator I + G' has a 
causal inverse on W^, A{xe W[ e \ P T x = 0}. Let G{, G' % be Lipschitz 
continuous causal operators on W x and / + G' — I + G 2 G[. Then the 
feedback system is unstable if there exists a scalar c such that / + cG x 
has a noncausal Lipschitz-continuous inverse on W x and if 
(<?£ _ cI)G x {I + cGi)- 1 is a contraction on W x . 

Proof: Theorem 2.23 shows that (/+ G 2 G X )~ X (the inverse on W Xe ) 
does not map W x into itself. Thus there is at least one input u x e W x and 
w 2 = such that e x e W u — W e which thus yields instability. 

Corollary 4.4.1 

Let PF a be a Hilbert space, and let the preliminary conditions of 
Theorem 4.4 be satisfied. Then the feedback system is unstable if for 
some real numbers < a < b, the operator G 2 is incrementally strictly 
inside the sector [a,b] on W[ t I + \{a + b)G[ has a noncausal 
Lipschitz continuous inverse on W 19 and G[ is outside the sector 
[-lib, -1/a] on W x . 

Proof: This corollary follows the ideas explained in Section 2.8 in 
making the contraction condition of Theorem 4.4 more explicit. 

The above results can be improved somewhat through linearization. 
This will be discussed in detail in Chapter 7. 14 



4.5 Continuity and Discontinuity 

This section is concerned with the continuity and discontinuity of the 
feedback system. The definitions of continuity, Lipschitz continuity, 
and discontinuity have been given in Section 4.2. Much of the pre- 
liminary discussions of stability in Section 4.4 carries over to continuity. 

The study of continuity of feedback systems is not standard in the 
engineering literature. 15 It is, however, an important concept for 

14 Related instability criteria (in a much more restricted context) have first been 
demonstrated in a Lyapunov setting by Brockett and Lee (Ref. 18). In their present 
setting and generality they originate with the author (Ref. 6). 

15 Continuity of feedback systems has first been explored by Zames (Ref. 16). 
The concept as such does not appear to have had a great deal of success, although 
it makes good sense (more so than stability) as a general nonexplosiveness condition 
for tracking systems. 
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systems for which the desired output is not a priori fixed (in other 
words, for tracking systems). The feedback system should then be 
viewed as follows: An (a priori undetermined) signal drives a stable 
feedback system and generates a desired output. Additive in this input 
is an undesired "small" noise component. Continuity then requires the 
corresponding output to differ from the desired output also by a 
"small" amount. 

4.5 J Conditions for Continuity 

The first theorem on continuity is merely a rephrasing of definitions. 

Theorem 4.5 (Continuity) 

Consider the feedback system described by the equations (FE) and 
let G map W Xe x W 2e into itself according to G(e x ,e 2 ) A (G 2 e z , —G x e x ). 
Then the feedback system is continuous if and only if: 

1 . It is well posed. 

2. The inverse (/ + G)~ x (which exists on W Xe x W 2e by 1) maps 
W x x W % into itself and is continuous on W x X W 2 . 

The system is Lipschitz continuous if and only if in addition: 

3. The inverse (/ + G)" 1 is Lipschitz continuous on W x x W 2 . 

Proof: The definition of continuity and Theorem 2.1 lead immediately 
to this result. 

Theorem 4.5 leads to the following interesting alternate definitions 
of continuity. 



Alternate Definition of Continuity (I). The feedback system described 
by equations (FE) is (Lipschitz) continuous if and only if: 

C.l. It is well posed. 

C.2. Let u t g W iei i = 1, 2, be given (but arbitrary), and let e i and y t 
be the corresponding errors and outputs. Then there exists, given 
any e > 0, a d > such that the inputs u t + Ai^ with Aw^ e Wi and 
|| Ai^ || , || A w a || < e yield corresponding errors, e i + A^, and outputs, 
y i + Ay i9 with Ae l5 Ay 2 e W l9 Ae 2 , &y x eW 2i and \\Ae x \\ 9 ||Ae a ||, 
IIAjiJi, i|Aj 2 || < d (for some fixed K < oo, the inequalities hold 
with d = Ke). 
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Alternate Definition of Continuity (II). The feedback system described 
by equations (FE) is Lipschitz continuous if and only if: 

C.l. It is well posed. 

C.2. There exist constants p 1? p 2 < oo such that any w^>, w (2) e W ie , 
i = 1, 2, yield corresponding errors, e\ l \ e\ 2 \ and outputs, y\ 1] , j>j 2) 
satisfying \\P T (e™ - e< 2 >)L lli > r(4 1) - e««)||, Bi^Of > - 7i 2, )li, 
lUVOf 1 - jf Oil < Pi il^(*4 1) - «i a) )ll + Pa ll^i 11 - " 2 2) )ll f^ 
aU T e 5. - 

Alternate Definition of Continuity (III). The feedback system described 
by equations (FE) is Lipschitz continuous if and only if: 

C.l. It is well posed. 

C.2.° There exist constants p x , p 2 < oo such that for any wj 1 *, w( 2) e J7 ie 
with 



sup 



1 



PrCiij 11 - «| 2) )||< oo, i«l,2, 



t^to T — T 
the corresponding errors and outputs ej 1 *, e| 2) and j^ 1 ', i yJ a) satisfy 

sup -^— WPM" - ei M )L sup ^-i^- \\P T (e { ? - 4 2, )il, 



r^ro T — T 



sup 

T^To T — T 



r^To T — T 
IUV34 1 ' - j4")ll, sup -J— ■» '«<» - "<•* 

T>To T — T 



\\pM v - yi 2) )\\ 



< Pl sup — ±— WPM" - Oil 



, + P2 SUp 



1 



t>tq T — T 



IIF^tti" - tti")ll. 



Theorem 4.6 

Consider the feedback system described by the equations (FE) and 
assume that: 

1 . It is well posed 

2. The operator G 2 is Lipschitz continuous on W 2 

3. The operator G 1 is Lipschitz continuous on W x or there exists an 
e > such that for all x x , x 2 e W 2e and TeS, \\P T (^i ~ G 2 x 2 )\\ > 
c\\P T (x x -xJl 
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Then I + G 2 G 1 is invertible on W le and the feedback system is Lipschitz 
continuous if and only if ||(/ + G^G^ 1 ^ < oo on W x . 

The proof of Theorem 4.6 evolves completely analogously to the 
proof of Theorem 4.3 and is left to the reader. 

Notice that under the assumptions of Theorem 4.6 Lipschitz con- 
tinuity results if and only if there exists a real number e > such that 
for all Tg S and x l9 x% e W lt 

\\P T (I + G % GJxj - P T (I + G % GJx\\ > e \\P T x x - P3cc.ll. 

It is now a simple matter to use the invertibility theorems of Chapter 
2 to obtain specific conditions 18 for continuity. These conditions are 
stated in the following corollaries. 

Corollary 4.6.1 

The feedback system described by the equations (FE) is Lipschitz 
continuous if G t and G 2 are Lipschitz continuous on their domain and 
if the open loop is contracting on W x (i.e., HG^iIIa < !)• 

Corollary 4.6.2 

Let W x = W 2 . Then the feedback system described by the equations 
(FE) is Lipschitz continuous if it is well posed, if G 2 is Lipschitz 
continuous on W 2 , and if there exists a scalar c such that I + cG 2 has 
a causal inverse on W le , the inverse (I + cGj)- 1 is Lipschitz continuous 
on W x and (G 2 — cI)G x (1 + cGJ- 1 is contracting on W x . 

Corollary 4.6.3 

Let W x ~ W 2 be a Hilbert space. Then the feedback system described 
by equations (FE) is Lipschitz continuous if it is well posed, if G z is 
Lipschitz continuous on W 2> and if for some real numbers a < b, 
Z> > 0, the operator G z is incrementally strictly inside the sector 
[a 9 b] 9 1 + \(a + b)G 1 has a causal inverse on W u , and G x satisfies one 
of the following conditions : 

1. a < 0, and G x is incrementally inside the sector [— 1/b, — Ija] 
on W x \ 

2. a > 0, and G x is incrementally outside the sector [—Ija, —1/b] on 
W u \ or 

3. a = 0, and G x 4- (1/6)7 is incrementally positive on W Xe . 

18 These conditions are again essentially due to Zames (Refs. 14, 16). Compare, 
however, the H^G^Ia < 1 condition with the \\Gt\L \\GAa < 1 condition of 
Ref. 16. 
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Corollary 4.6.4 

Let W x = W z be a Hilbert space. Then the feedback system described 
by equations (FE) is Lipschitz continuous if it is well posed, if G 2 is 
incrementally positive on W 2e , and if G x is incrementally strictly 
positive and Lipschitz continuous on W v 

Proofs of these corollaries follow from Theorem 4.6 and Theorem 
2.12, Theorem 2.13, Corollary 2.15.1, and Theorem 2.16. The details 
are left to the reader. 

Corollaries 4.3.1 and 4.3.4 are again very intuitive in nature. In all 
of the theorems just cited, the roles of G x and G 2 can of course be 
interchanged or replaced by a causal factorization of G % G X . 

4.5.2 Conditions for Discontinuity 

It would be possible to give here some specific conditions for dis- 
continuity. These would, however, be identical to the instability 
conditions of Theorem 4.4 and Corollary 4.4.1. Since instability implies 
discontinuity (stability is actually continuity at the origin), these 
conditions are thus quite conservative. Sharper conditions will be given 
in Chapter 7 when discussing linearization. 

4.6 Concluding Remarks 

4.6.1 Sensitivity 

A concept related to stability is that of sensitivity. Two types of 
sensitivity are usually considered in the literature: the first is sensitivity 
with respect to measurement noise, and the second is sensitivity with 
respect Jo modeling errors. 

Consider the feedback system shown in Figure 4.6 and the open-loop 



Figure 4.6 Feedback System 
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Figure 4.7 Equivalent Open-Loop System 



system shown in Figure 4.7. The system G is called the plant, C ± is the 
forward-loop compensator, C 2 is the feedback-loop compensator, and C 
is the open-loop compensator: It is assumed that all these operators are 
well-defined causal mappings between appropriate extended spaces and 
that they satisfy the usual regularity conditions. Let W u be the input 
space and W z& be the output space. 

It is assumed that under the nominal operating conditions, n = 0, 
the feedback and the open-loop systems are equivalent. This imposes 
certain conditions on the compensators C l5 C 2 , and C. In fact, this 
restriction is equivalent to the condition that GC^I + C^GC^r 1 — GC 
on W le . 

One of the primary purposes of feedback is sensitivity reduction. 
For output noise this means that for the same noise n, y differs less 
from its nominal value when the feedback system is used than when the 
open-loop system is used. 

Definition; Let u e W u be given and let y e W 2e be the corresponding 
solution with n = 0. This output will of course be the same for the 
open-loop system as for the feedback system. Let u e W u and n e W %& 
be given and let y x e W 2e be the output of the feedback system and 
y% e W 2e be the output of the open-loop system. Then in general 
y % ^ y lm Let 6y x =y x — y and by % = y z — y = n. Then the feedback 
system is said to reduce the sensitivity with respect to output noise if it 
is well posed and if for all u x e W le , w 2 e W 2e , and T e S, \\P T dy^}] < 
\\P T dy 2 \\ = \\PT n \\' O ne can also define in the same way sensitivity 
reduction along a particular input (usually taken u = 0). The particular 
type of sensitivity reduction one wants again depends on the application. 



The relationship between sensitivity reduction and finite gain 
stability is clear: for sensitivity reduction (around u = 0) one wants 
|| J + G 2 (— Gj)\\ < 1, whereas for finite gain stability one wants 
\\I + <j 2 (— GO || < oo. Stability can thus be regarded as a property of 
a system with a finite sensitivity coefficient whereas sensitivity reduction 
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4, 6 J Doubly Infinite Time-Intervals of Definition 

Besides having dubious physical significance, it turns out that the 
case in which the time-interval of definition, S, is (— oo,+ oo), leads to 
some mathematically very severe questions of well-posedness. 

On examination of the definitions of stability and continuity as 
presented here, one observes that well-posedness of the feedback 
systems is taken as one of the conditions. This is not standard although 
it is, in the author's opinion, the most logical way to proceed. It is 
entirely possible, however, to separate the questions of well-posedness 
on one hand and stability and continuity on the other. The "advantage" 
of such an approach is that this allows one to study these two questions 
separately. This, however, is somewhat fallacious since both questions 
will eventually have to be faced and since the most logical approach is 
to examine the well-posedness question first. Well-posedness indeed 
expresses a very fundamental property which needs to be satisfied in 
order for any analysis to be well founded. Moreover, well-posedness and 
stability are not really unrelated in the sense that if a particular feedback 
system is not well posed, then the model has to be modified, and such 
a modification will in general alter some fundamental properties of the 
feedback system including, possibly, its stability properties. It suffices 
to examine the Nyquist criterion or the circle criterion with the intro- 
duction of a small delay in the loop as a convincing illustration of this 
fact. 

If one, however, treats separately the questions of well-posedness 
and stability or continuity, then it turns out that the specific conditions 
obtained in Corollaries 4.3.1, 4.3.2, 4.3.3, and 4.3.4 and Corollaries 
4.5.1, 4.5.2, 4.5.3, and 4^.5.4 are in fact still valid as conditions for 
stability or continuity. 

As far as Theorem 4.4 on instability is concerned, there is then of 
course no need for backwards extensions of the involved operators 
when S = (— oo,+ oo) and instability results with the conditions of the 
theorem or its corollary on the operators G x and G % directly. The basic 
idea behind the resulting instability theorem is essentially the following: 
Let S — (— oo,+ oo). Then the feedback system described by equations 
(FE) is discontinuous (which is then defined without involving well- 
posedness) if the operator (/ + G%G X ) has a noncausal inverse on W x . 
This in fact then leads to the following phenomenon: Assume that a 
feedback system is discontinuous with 5 = (— oo,+ oo) and that 
/ -1- G%G X is invertible on W x but that this inverse is not causal (not all 
discontinuous systems need have this property but some do, as 
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illustrated by Theorem 4.4). Then / + G % G X is not invertible on W XT A 
{xeW\P T x = 0, re (-co, + oo) given} for at least one Te 
(_co,+ co). Weak additional assumptions on G % G X (see Lemma 2.2) 
thus imply that in fact (/ + G Z G^) will not be invertible on W 1T > A 
{x e W | P T ,x = 0, Te (- oo, + oo) given} for any T e (- oo, + oo). 
Again, weak assumptions on G 2 G X (see Section 4.3) ensure that 
(/'+ G 2 G X ) will on the other hand have a causal inverse on W XT . % for 
any T e (— oo,+ co). Hence for such discontinuous systems there 
exists for any Te (— oo, + oo), an input u x e W x and u z = with 
P T u x = such that at least two solutions exist to the equations 
describing the feedback system (and thus for linear systems an infinite 
number of solutions exist). One solution yields e x eW through e x = 
(/ + GzGj)- 1 ^ with (/ + G 2 G X )~ 1 the noncausal inverse on W u and 
the other yields e x e W 1T , B9 e x $ W x , through e x = (/ + G 2 GJrhi u with 
(/ + G^G x y x the causal inverse on W XT < 6 , It is clear that the latter one 
is the "physical" solution in the dynamical-system sense and that the 
first one (which extends to — oo) is a mathematical artifact. 

A study of the above issues leads to at least one partial result con- 
cerning well-posedness of feedback systems with S= (— oo,+ oo). 
Since it is of some mathematical interest, it is given below. First* 
however, one more definition is given. 

Definition: Let S = (-co,+ co), let Wbc a Banach space, and W e be 
its extension. Assume that the family of projection operators {P T }, 
TeS, is a resolution of the identity on W\ i.e., for all xeW, 
lim^^ P T x — and lim^.^ P T x — x and let G be a causal operator 
from W e into itself. Then the feedback system described by the equation 
(/ + G)e = u is said to be continuous if for any given ueW every 
solution e e W e to the above equation actually yields e e Wand if given 
any u e Wand e > there exists a d > such that if || du]\ < e then any 
solutions e and e + Se corresponding to, respectively, u and u + du 
satisfy ||fc|| < d. 

Theorem 4.8 

Assume that G is a causal operator on W e and that (/ + G) has, for 
any T e S = (— oo,+ oo), a causal. inverse on W Te A {x e W e \ P T x = 
0, TeS given}. Then the feedback system described by the equation 
(/ + G)e = u is continuous if and only if / + G has a continuous 
causal inverse on W (and hence on W e ). 

Proof: Since the feedback system is continuous and since I + G has, 
for any TeS, a causal inverse on W Te , {I + G)~ x maps for any TeS, 
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W T A. {x e W j P T x ~ 0} into itself and is continuous on W. Since 
{Py}, re S, is a resolution of the identity, the family of subspaces 
{W T }> Te S, is dense in JF. Since (7 + G)~ x is thus continuous on a 
dense set, a continuous inverse exists on the whole space, and this 
inverse may be obtained by a continuous extension of (/ -1- GO" 1 - A 
simple argument by contradiction then shows that this inverse is 
causal as well. 

The value of the Theorem 4.8 appears to be that it gives a condition 
for well-posedness (causal invertibility of (I + G) on W e — other 
considerations are ignored here) when the time interval of definition 
S is (— oo, + oo). This condition is very strong indeed, since it requires 
continuity of the feedback system. 
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g e L le (0,co), {g n } 9 n e /+, is a real- valued sequence satisfying 

1 |g»l < co 

for all Te [0,oo), and k(t) is a real-valued function on [T ,oo) with 
k(t)€L M (T n9 oo). 

The functional equations describing the feedback system are 

e x (t) = Ul {t) - y z (t) 

« a (0 = ««(0+7i(0 

(LFE) 

j>i(0=(g?a)(0 

j 2 (0 = {G % e % )(t) 



5 The Nyquist Criterion and the 
Circle Criterion 

The results obtained in the previous chapter will now be applied to a 
particular class of linear feedback systems. They consist of a linear time- 
invariant system in the forward loop and a linear memoryless gain in 
the feedback loop. The case in which this gain is constant will be given 
special attention, and for this case necessary and sufficient conditions 
for stability will be obtained. For the time-varying case sufficient 
conditions for stability and instability will be derived. The results 
presented in this chapter are extensions of the Nyquist criterion and 
the circle criterion. 

5.1 Mathematical Description of the Feedback System 

Consider the feedback loop shown in Figure 5.1. In terms of the 
notation used in the previous chapter let the time-interval of definition 
S = [r 0) oo) 5 V x — F 2 = R, i.e., u u e ly y l9 w 2s e 25 y 2 are real-valued 
functions of time on [T Q ,oo) y and let G ± and G % be formally defined by 



(G 1 x)(0A2g«x(t-O + 



/*00 

sit- 

J To 



r)x(r) dt 



(G 2 x)(0 A k(t)x(t) 

where {t n }, n e /+, is a sequence of real numbers with t ~ 0, t n > for 
n > 1 , g(t) is a real- valued function on R with g(t) = for t < and 
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Figure 5.1 The Linear Feedback System under Consideration 

The solution space will be taken as L pa (T 09 oo) 9 1 <^ < oo, with 
^ 6 (T ,cx)) defined in the usual way. The inputs u l9 w 2 are assumed to 
be given elements of L ve (T Q ,co). 

A solution to the above feedback equations thus consists of a 
quadruple of real-valued functions on [T G ,oo) 9 {e l9 e 2i y l9 y 2 } with 
*u ^a, J>i» y2^L pe (T ,<x>) and which satisfy the feedback equations 
(LFE) for almost all t > T . 

The operators G r and G 2 have been studied in detail in Chapter 2. 
It follows readily from Minkowski's inequality and some direct 
estimates that G x and G 2 map L pe (T 09 co) into itself. They are in fact 
both locally Lipschitz-continuous linear operators on L„ 6 (T 0s od) 9 but 
for boundedness additional conditions are necessary. G x is moreover a 
time-invariant operator and G 2 is time invariant if and only if k(t) is 
almost everywhere equal to a constant. 
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The feedback system described by the equations (LFE) thus satisfies 
the assumptions W.1-W.4, G.1-G.3, and LI, and the theory developed 
in Chapter 4 is applicable. 

The values of the gain k{t) is of course what is important for the 
characterization of the operator G 2 . The operator G x has been charac- 
terized by the function g and the sequence {g n ,t n }, but it is, both from a 
mathematical and an engineering point of view, much more conveniently 
characterized by its Laplace transform. More precisely, assume that for 
some real number <r, g(t )e~ at e L x (0, co) and {g n e~ atn } e i x . Then the 
Laplace transform of G u 



G(s) A 2 g n e~ st » + P 

nei + Jo 



g(t)e~ st dt 



exists for Re s > a and is analytic in Re s > a. Let x(t) be such that for 
some real number a\ x(t)e~ a/t e L^T^co) then 

X(s)^rx(t)e~ st dt 

J To 

exists for Re s > or' and the function 
Y&aT (G x x)(t)e- St dt 

J To 

exists for Re s > a, & and in fact equals G(s)X(s). 



5.2 Well-Posedness 

The first question is that of well-posedness. A simple additional 
condition is therefore required. More precisely: 

Theorem 5.1 

Consider the feedback system described by the functional equations 
(LFE) with L ve (T ,oo)y 1 <;? < co, as the solution space. Then this 
feedback system is well posed if and only if WgJc(t)\\ Laa{TQt<x>) < 1. 
Proof: The sufficiency part of the theorem is a direct application of 
Corollary 4.1.3. Indeed, the open-loop operator is the sum of a strongly 
causal operator and a feedthrough which is precisely the time-varying 
gain gjc(t). Since the gain on L V (T ,T), 1 <p < oo, is given by 
ll&MOIIzcotTo.r) < 1l*<M0ll£.(T Q *,, sufficiency follows. 

Next, assume that the gain k(t) is replaced by the gain Jc'(t) = 
k{t) + r. Then for some \r\ < e, no matter how small e, \gak'(t)\ > 1 
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on a set on nonzero measure. Assume now that a delay of length e' ^ 
is inserted in the loop and consider as the input a unit step starting at 
T' > T , where T is such that any interval [T',T'+ a], a> 0, 
contains a set of nonzero measure, where \gjc!(i)\ > 1. It can then be 
shown that the response to this input will lead to an ill-posed situation 
on L^Tq^co), 1 < p < oo, in the sense that lim e ,_ lim € _ e e , € will not 
equal the response obtained by taking e' = e = 0. The mechanics of 
this explicit calculation are similar to those used in Corollary 4.1.3. 
The details are left to the reader. 

Theorem 5.1 shows, among other things, that the case where the 
forward loop represents a strongly causal system (g — 0) is the only 
one which should be used in connection with an unbounded gain k(t) 
in the feedback loop. 



5.3 Stability and Instability in the Time-Invariant Case 

The stability results obtained in this section essentially constitute the 
classical Nyquist criterion. 1 The case under consideration is where 
the feedback gain k(t) equals a constant almost everywhere and where 
the system is open-loop stable (the kernel of the forward-loop convolu- 
tion operator is integrable). First, however, several definitions are in- 
troduced. 

Definitions: Let LA denote the algebra consisting of elements de- 
termined by a real-valued Lx(— oo, + co)-function g, a real-valued 
^-sequence {g n }, n e/ + , and a sequence of real numbers {t n }, n e/ + . 
Addition of x x = (g u {gnJn}i) and x z = (g^Ag^Qz) is defined as 
x x + x 2 A (g x + gi,{g n ,t n }*)> wh ere the sequence {g ny t n } 3 consists of 
exactly all pairs {g n ,t n } x and {g ni t n } 2 . Multiplication by scalars is defined 
as ax = <x.(g,{g n ,t n }) A (oLg 9 {aig n9 t n }) 9 and multiplication of elements is 

1 The Nyquist criterion is well known, although many of the engineering texts 
which treat this stability problem a priori equate stability with the absence of right- 
half-plane singularities of the closed-loop frequency response. Although this is 
clearly valid for instance in the case of rational functions, this equivalence is far from 
obvious for more general transfer functions. The results stated in these texts should 
thus be carefully interpreted when applied to nonrational transfer functions since it 
is known that one indeed can get into difficulties for sufficiently complex transfer 
functions (see Ref. 1). Recently Desoer (Refs. 1-4), recognizing this difficulty, has 
obtained rigorous derivations to essentially the level of generality treated in this 
chapter (but with multiple inputs). These papers, however, treat only stability, 
whereas the results obtained in this section treat instability as well. 
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defined by 



x x x 2 A ( f + °gi(' - r)g&) dr + \ («-)i^ r " W^ 



+ 2 (g»)agi(* 



(*n)a)> {En> x 



Jsj, 



where OJs = {Ui © {*»}■ (i- e -> a11 elements of the form * tt = t ni + t n% 
where t %l and /^ range over {*Ji and {Ua respectively), and the 
element g n corresponding to t n in {g n ,t n } z is given by g ni gn^ ^ lth 
/ = t +U. Let the norm on LA be defined by \\g\\ Lt + life*} 11^. It 
can be shown that L4 as defined above is a real commutative Banach 
algebra with the unit e A (0,fe n A» where g Q = 1, / = ° and 8* = ° 
otherwise. 

Consider now the subset of L4, L^+ 5 which consist of all elements 
of LA which satisfy g(t) = for t < and f„ > for all n. It is easily 
verified that LA+ is a subalgebra and that it contains the unit. The 
details of the proofs of the above claims can be found elsewhere (Ref . 
5, pp. 141-157). . 

The Laplace transform of an element of LA is defined as the function 
of the complex variable s defined by 



r 



G(s)4lg/""+ g(t)e- si *dt. 

71=0 J— CO 

It is well defined and uniformly continuous along Re s = for elements 
of LA. It is well defined and analytic in Re s > for elements of {LA)+. 

The importance of the algebras LA and LA+ stems from the fact that 
they define subspacesof J2?(L,(-co, + co),L,(-oo,+ co)), \<p < °o: 
namely those subspaces formed by convolution operators with kernels 
consisting of an integrable function and a summable string of impulses. 
In fact, L.4 represents "almost" all time-invariant operators m&(L 9 ,L 9 ) 
and by and large all those of interest in mathematical systems theory. 
Multiplication and addition in LA correspond to those operations in 
££{L ,L V ). Moreover, LA forms a regular subalgebra of <£{L X ,L X ), 
&{LM, and &(L m ,L„) in the sense that every element of LA which 
is invertible in J^(L 2 ,L a ), for example, is invertible in LA itself. LA+ 
is of particular importance; the corresponding elements of &(L P ,Lv), 
1 < p < oo, are causal. 
The next lemma plays an essential role in the results which follow. 
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Lemma 5.1 

Let (g,{g n ,t n }) e LA+. Then (g 9 {g n ,t n }) is invertible (regular) in LA+ 
ifandonlyifinf Ee4r>0 |G(j)| > 0, and in LA if and only if inf Re s==0 \G(s)\ 
>0. 

A proof of this classical result can be found in most books treating 
convolution operators (Ref. 5, pp. 150, 155). 

Application of the above lemma leads to the following necessary and 
sufficient condition for stability of linear time-invariant feedback 
systems. 



Theorem 5.2 

Assume that the feedback system described by the functional equa- 
tions (LFE) is well-posed and that k(t) = K almost everywhere. 
Assume furthermore that {g n } e l x and g e ^(0,00) (i.e., the feedback 
system is open-loop stable). Let G(s) denote the Laplace transform of 
(g>{gn>t n })- Let L 2 (Tq,co) be the space with respect to which stability is 
defined. Then the feedback system is stable if and only if 

inf |1 +KG(s)\ >0. 

Re «>0 

Proof: It suffices by Theorem 4.3 to demonstrate that the operator 
/ + G Z G X is invertible on L 2 (T 0> cd) if and only if the conditions of the 
Theorem are satisfied. There are three mutually exclusive possibilities: 

1- inf R e^oU+^^)l>0; 

2. inf Res=0 il +KG 1 (s)\ - 0; or 

3. inf Res>0 fl + KG x (s)\ = and inf Res==0 11 + KG x (s)\ > 0. 

It needs to be shown that case 1 yields invertibility and that in cases 2 
and 3, the operator / + G 2 G X is not invertible on L 2 (T ,oo). In the first 
case, it follows from Lemma 5.1 that / -h G X G 2 has a bounded causal 
inverse on £3(7*0,00) which yields stability. Assume next that case 2 is 
satisfied; then / + G % G X multiplies the limit-in- the-mean transform of 
the element on which it operates by 1 + KG{joy) and thus the only 
candidate for the inverse is the operator which divides the limit-in-the- 
mean transform of the element on which it operates by 1 + KG(jco). 
Thus for this inverse to be bounded, (1 + KG x (ja)))~ x ought to exist 
for almost all co e R and belong to.L^. Since G x (jco) is continuous and, 
by assumption, infill + KG(j(o)\ = 0, the operator /+ G Z G X has 
in that case no bounded inverse on Z, 2 (!T , oo), which thus yields 
instability. Assume finally that case 3 is satisfied. Lemma 5.1 then 
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implies that the operator /+ Cfi'^ defined on Z, a (— co,+ oo) by the 
same formal expressions as G x and (7 a (but with the lower limit in the 
convolution integral defining G 2 replaced by — oo), has a noncausal 
inverse on L 2 (— oo,+ oo), which thus yields the noninvertibility of 
/ + G%G X by an argument similar to the one used to prove Theorem 
2.22. 



Remark 1: The above theorem is well known, although the usual 
proofs assume the equivalence of stability and the absence of singulari- 
ties of (1 + KGis)) -1 in Re s > 0, and lack therefore a certain amount 
of justification (see Ref. 1). Notice that since the system is linear, 
instability implies that there exists an input u x e L a (!T ,6o) with w 2 = 
such that e x and e 2 e L 2e (r o , oo) — L 2 (J' ,oo). 

Remark 2: If stability is defined with respect to L v (T 09 oo) t 1 < p < oo, 
then it is clear that the condition of the theorem is still sufficient for 
stability. If the condition fails because case 3 prevails, then instability 
results. These claims follow the proof of Theorem 5.2. If the condition 
fails because case 2 prevails, the situation is more complex. It can 
still be shown that then instability results at least when/? = 1 or oo, 
since J + G % G X is invertible on L P (T' ,co), p — 1, oo, if and only if it 
is invertible in LA+. 

Remark 3: It is possible to verify, at least in some cases, the condition 
infae.>o U + KG 1 (s)\ > by: 1 , establishing that inf^ |1 + KGQ<a)\ 
> 0; and 2, checking whether KG(jco) encircles the —1 + Oj point. 2 
It has not been possible as yet to completely generalize this condition 
to the case under consideration, mainly because it appears to be no easy 
matter to give a suitable generalization of the no-encirclement con- 
dition. One important particular case is stated below, namely, when 
the delays are equally spaced, i.e., when t n = wTfor some T > 0. 

Definition: The argument of 1 + KG(jto), denoted by 0(eo), with 
1 + KG(jco) t* 0, co eR, is defined as the continuous function with 
0(0) == such that for all to e R, 1 4- KG(jco) = |1 + XG 1 (/«w)|^ fl( *> 

Theorem 5.3 

The condition inf Kcs>0 1 1 + KG(s)\ > is equivalent to the following 
conditions: 

1. inf Res=0 11 + KG(s)\ > 0, and 

2. lim^^^ BiNlrrT-*) exists and is zero. 

2 For a proof of this case when^ n = for all n 5* see Reference 6. 
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Proof: Let A(s) A 2"= gn^~ snT and L(s) A So g(0^ H dt. The function 
G{jcS) — A(jco) + L(jco) is the sum of a periodic function A (yco), and a 
bounded function L(jw) that, by the Riemann-Lebesque lemma 
(Ref. 7, p. 103), approaches zero as |o>| -> 00. Since inf Res=0 |1 + 
KG(s)\ > by condition 1, it follows that inf tt£i2 11 + KA(jco)\ > 0. 
Since lim^^ OiNlirT" 1 ) exists by condition 2, it follows that the argu- 
ment O(w) of 1 + KA(ja>) satisfies 0(2tt2^ 1 ) = O(0). Thus by the 
principle of the argument (Ref. 7, p. 216), there are no zeros of the 
function R(z) — 2*= £„^ n inside the unit circle since R(z) is analytic 
inside the unit circle and since the increase in its argument as z moves 
around the unit circle equals zero. Thus the function 1 + KA(s) has 
no zeros in Re s > 0. Consider now the contour in the complex plane 
shown in Figure 5.2. The increase of the argument of 1 + KG(s) as s 
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Figure 5.2 A contour in the Complex Plane 

moves around this contour is zero for N and a sufficiently large. Indeed, 
along C x it is zero by the assumption lim^^^ OiNlnT- 1 ) = 0, and 
along C 29 C 3 , C 4 it is zero since G(s) is arbitrarily close to A(s) along 
that part of the contour. Hence, 1 + KG(s) has by the principle of 
the argument no zeros in any finite part of the half-plane Re s > 0. 
It is bounded away from zero in Re s > since it arbitrarily closely 
approximates |1 + KG(s)\ for large values of \s\ in Res > 0. Thus, 
|1 + KG(s)\ is indeed bounded away from zero in Res>0, as 
claimed. This argument of this proof is easily reversed to yield the 
converse of the theorem. 



Theorem 5.3 yields a systematic procedure for obtaining the ranges 
of the feedback gain K which yield stability and instability. This is 
illustrated in Figure 5,3 and requires verifying: 1, that the —IjK point 
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Figure 5.3 Illustrating the Nyquist Criterion 

does not belong to the Nyquist locus G(Ja>); and 2, that the Nyquist 
locus does not encircle the — IjK point. 



5.4 Stability and Instability in the Time-Varying Case 

All elements are now available to state the circle criterion 3 as applied 
to linear time-varying feedback systems. Let k min < & max be real 
numbers with & raax > 0, and let the critical disk 2, be defined as 



mtn max 




Im 




:0<k 



Figure 5.4 The Critical Disk 

3 There are several more or less independent and simultaneous sources for the 
stability part of the circle criterion. The work of Sandberg (Refs. 8, 9) andZames 
(Refs. 6, 10) has influenced attempts to obtain these results in an input-output 
stability setting. Other treatments of the circle criterion will be found in References 
11, 12. The instability part of the circle criterion originates — in a more restricted 
setting — in Brockett and Lee (Ref. 13), where it is obtained using Lyapunov 
methods. It was obtained by the author (Ref. 14) in the framework of input-output 
stability. The circle criterion as it is presented here is in essence a direct generalization 
of the Nyquist criterion and should thus be of considerable interest in engineering 
design. Note that the open loop unstable case is not treated here, in contradistinc- 
tion to Reference 13. 
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follows (see Figure 5.4): 

1. If A: max > k min > 0, then 2 denotes the closed disk centered on the 
negative real axis of the complex plane which passes through 
the points -l/fc min and -l/fc max . 

2. If k min < and £ max > 0, then 2 denotes the outside of the open 
disk centered on the negative real axis of the complex plane which 
passes through the points — l/fc min and — 1/A; max . 

3. If fc min = and fc max > 0, then Q denotes the closed half-plane 
Rej< -l/fc max - 

Let G(s) be the Laplace transform of (g 9 {g n9 t n }) anc * * et ^ + denote 
the set in the complex plane determined by ^+ A {G(s) | Re s > 0} and 
let ^° denote the set in the complex plane determined by 

&° a {G(s) | Re s = 0}. 
Let 

d(9&+) = inf|x - y\, 

and let d(3 y ^°) be similarly defined. The set ^° is sometimes called 
the Nyquist locus of G x . 

Theorem 5.4 

Assume that {g n } e l ± and g e L x (0, oo) in the feedback system 
described by the functional equations (LFE), i.e., that the feedback 
system is open-loop stable. Let G(s) denote the Laplace transform of 
(g,{g n ,t n })' Let L 2 (T , oo) be the space with respect to which stability 
is defined and assume that the feedback system is well posed. Then the 
feedback system is stable if there exist real numbers fe min =^ &W with 4 
^max > sucn tnat ^min ^ ^(0 < ^max f° r almost all t > T and if 
</(#,#+) > 0. 

Proof: It suffices to verify that the assumptions of the theorem imply 
that the conicity conditions of Corollary 4.3.3 are verified. Assume 
first that Ar min and & max are nonzero. Letfc min , < A: min and fc max , > fc max 
be real numbers such that the disk 2 1 determined by fc min , and k, 



max' 



still satisfies the condition d(<3 u &+) > 0. Such numbers fc max ' and 
k m]n , exist since j^ x is arbitrarily close to 3) for fc max > — & max and 
k min — fc min . sufficiently small but positive. It is now easily verified that 
G 2 is strictly inside the sector [k min <,k mSkX '] on L 2 (T ,oo) since k min , < 



^min ^ *-(0 ^ ^max < ^n 



Furthermore since \g Q (k mln , + fc max ,)/2| 



< 1, / + ^iC^min' + &max')/2 has by Lemma 5.1 a causal inverse on 

4 Clearly assuming £ m ax ^ ^ < l° es not con stitute any loss of generality since this 
can always be achieved by replacing C? 2 by — G 2 and G x by — G^. 

\ 
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L 2e (T , oo). The only thing that remains to be shown is that G x satisfies 
the appropriate conicity conditions. Assume first that k mia < 0. It is 
then clear that G x is inside the sector [— 1/Ar max >, — 1 /^min-l on L 2 (T ,co) 
since G x corresponds to multiplication of the limit-in-the-mean trans- 
form by G(jo>) and d{2^ Q ) > 0. Assume next that k min > 0. It then 
suffices to prove that G x is outside the sector [— 1/A: min ,, — l/£ max '] on 
L 3 ^(T ,oo). Since / -f G x (k min > + k ma ^)/2 has a bounded causal inverse 
on L 2 (r ,oo) by Lemma 5.1, it thus suffices to show (again by con- 
sidering limit-in-the-mean transforms) that 

lid + *(AW + AwW^GtyaOnz. < 2(*W - *»*)-*. 

which is indeed implied by the frequency domain condition of the 
theorem. The only case remaining is when k min = 0. This case, however, 
follows from positivity considerations. 

Remark: If k{t) has no specified upper bound but if k{i) > k min , then 
stability results if one of the following conditions is satisfied : 

1- &min > and d(2$,y+) > 0, where B denotes the closed disk 
centered on the negative real axis which passes through the origin . 
and -l/fc min ; 

2. A: min < and d(2,@+) > 0, where 2 denotes the outside of the open 
disk centered on the negative real axis which passes through the 
origin and the — l/fc min points; or 

3. k mia = and Re G(s) > e > for all Re s > 0. 

In theory it is not necessary for the third condition that k(t) be 
bounded, provided it belongs to L 006 (J' OJ co). 



Theorem 5.5 

Assume that {g„}e/i and geL^O^co) in the feedback system 
described by the functional equations (LFE), i.e., that the feedback 
system is open-loop stable. Let G(s) denote the Laplace transform of 
fg*{gnJn})> Let Li(T ,<x>) be the space with respect to which stability 
is defined and assume that the feedback system is well-posed. Then 
the feedback system is unstable if there exist real numbers k min ^ £ max 
such that < k min < k{t) < £ m ax *" or almost all t > t and if 
d{9,9*) > and d(& 9 9+) = 0. 

Proof: Let G[ and G' % denote the operators defined on L 2 (— oo,+ oo) by 
(GixXO - t g n <t - Q + T g(t - t)x(t) dr 

n=0 J— oo 
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and 



(g;*X0 = *'(0*(0 



with 



k'(t) 



KK 
k{t) 



otherwise. 

It is a simple matter to verify that L % {— ao,+ oo), G[, and G^ qualify as 
backward extensions of L 2 (T Q ,oo), G u and G 2 and that all the con- 
ditions of Corollary 4.4.1 are satisfied. The estimates involved in this 
verification are in fact identical to the ones used in Theorem 5.4, and the 
noncausality follows from Lemma 5.1. 

Remark 1: It is again possible at least in the case where all the delays 
are equally spaced, i.e., t n = nT, T > 0, to rephrase the conditions of 
Theorems 5.4 and 5.5 exclusively in terms of the frequency response 
of the forward loop. In fact d{2 9 <S+) > if and only if: 1, d(3 9 9*) > 
0; and 2, lim^^ ©(NIttT- 1 ) exists and is zero, where & is the argument 
of 1 + olG(jo)) and a is an arbitrary element of 2. This leads to the 
situation shown in Figure 5.5. 
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Figure 5.5 Illustrating the Circle Criterion 
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Remark 2: Let ^ a A {G(s) \ Re s = or}. It can be shown that it suffices 
for instability that d(3JB«) > for some a > and that d(9&+) = 0. 
This in fact leads to an improved instability criterion 5 in situations such 
as the one illustrated in Figure 5.6. 
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Figure 5.6 Illustration of Remark 2 
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6 Stability Criteria Obtained 
Using Multipliers 



6.1 Introduction 

There are two basic transformations of feedback systems which lie 
at the foundation of most of the frequency-domain stability criteria as 
they have recently appeared in the control theory literature. 

The first one is the loop transformation shown in Figure 6.1. This 
transformation results in a shift of the conicity of the operators in 
the forward and the feedback loop and can, for instance, be used to 
transform a feedback system in which the forward and the feedback 
loop satisfy certain conicity conditions into a feedback system which is 
open-loop attenuating. Corollary 4.3.3 in fact rests on this principle. 
This procedure can also result in a feedback system with positive 
operators in both the forward and the feedback loop. 

The second basic transformation is the introduction of so-called 
multipliers in the loop. 1 This is illustrated in Figure 6.2 and results in 
the possibility of exploiting certain constraints of the operators in 
the loop. Assume, for instance, that the operator in the feedback loop 
is conic and satisfies some additional conditions — for example, that it 
is time invariant and memoryless (but nonlinear) or that it is a linear 
periodic gain. In general, it is then possible to find a multiplier which 

1 The idea of using multipliers can be traced back to the work of Popov (Ref. 1). 
Other authors who developed this technique are Brockett and Willems (Ref. 2) and 
Zames (Ref. 3). 
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Figure 6.1 Transformations of the Feedback Loop 

when cascaded with this operator will not change the conicity of the 
feedback loop, whereas the compensation of this multiplier in the 
forward loop will change the conicity of the forward loop, thus making 
it possible to show stability whereas this was not possible before 
the conicities were changed by means of the multipliers. 

The above two transformations are usually used in series in the sense 
that the conicity transformation is used first and results in making 
the operator in the feedback loop (which is assumed to satisfy certain 
constraints) a positive operator. 2 The forward loop is in this process 
appropriately modified so as to preserve the original input-output 
relationship. One then introduces a multiplier in cascade with the 
operator in the feedback loop. This multiplier is appropriately chosen 
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Figure 6.2 Illustration of the Introduction of Multipliers 

2 There is no a priori reason to make these transformations in precisely this 
sequence, although some theoretical considerations supporting this sequence can 
be made (Ref. 4). 
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out of a certain class without affecting the positivity of the feedback 
loop. If the compensation of this multiplier results in a positive operator 
in the forward loop, then stability results. 

These ideas are further explored in the remainder of this chapter. 
They will be investigated first from a theoretical point of view and then 
applied to two representative cases. The first case treats feedback 
systems with a periodic gain in the feedback loop, and the second case 
treats feedback systems with a monotone nonlinearity in the feedback 
loop. 

6.2 Transformations of the Feedback Loop 

The feedback loop referred to in this section is described by the 
functional equations (FE) as introduced in Section 4.2. These are 
restated here for convenience: 



«i = «i ™ J2> 
*a = "a + yu 
Ji = G x e u 
Ja = G 2 e 2 . 



(FE) 



It is still assumed that the assumptions W.l through W.4, G.l through 
G.4, and LI are satisfied. These assumptions are enumerated in 
Section 4.2. 



Theorem 6.1 

Consider the feedback system described by equations (FE) and 
assume that W x = W 2 . Let k ^ be a scalar such that — k~ x $ a(G x ) 
in Jf+iW-L^W^ (i.e., / + kG x has a causal inverse on W Xe ). Let G[ = 
G X (I + kG x y x and G' 2 = G 2 - kl. Let u x e W u and w 2 e W 2e be given 
and assume that {e lf e % ,y l9 y 2 } is a corresponding solution. Then the 
quadruple 

{e'i - *i + ky l9 e 2 = e 2 , y[ = y l9 y' 2 = y 2 - ke 2 ) 

satisfies the equations 



<?i = u x - y 2 > 
e' % ^u 2 + y[ 9 
y x = G[e l9 
y 2 — G' % e 29 



(FE') 
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where u' x = w x — &w 2 and w^ = « a . Moreover, if the feedback systems 
described by the equations (FE) and (FE 7 ) are both well posed, then 
(finite gain) stability (continuity, Lipschitz continuity) of one implies 
and is implied by (finite gain) stability (continuity, Lipschitz continuity) 
of the other. \ 

Proof: The first part of the theorem is readily proved by direct verifica- 
tion, whereas the stability claim is a direct consequence of the relation 
between the solutions, since this relation is easily reversed from (FE') 
to (FE). 

Theorem 6.2 

Let Ne^(W le9 W le ) + (W u ,W u )) be regular in ^+(W Xe ,W u ) 
(@ + (W U9 W u )) and MeO+(W u ,W^ 0HW 2e9 W 2e )) be regular in- 
@+(W 2ei W 2e ) (&+(W 2a ,W 2e )), i.e., M, M~\ N 9 N~ x are causal bounded 
(Lipschitz-continuous) operators. Consider now the feedback system 
described by the equations 



e'i = u x - yi, 
el - «S + yl 

y'i = Ark^N-M, 

yl =NG 2 Me' 2 . 



(FE") 



Assume that both feedback systems (FE) and (FE") are well posed. Then 
the feedback system described by the equations (FE) is (finite gain) 
stable (continuous, Lipschitz continuous) if and only if the feedback 
system described by the equations (FE") is (finite gain) stable (con- 
tinuous, Lipschitz continuous). 

Proof: It is a simple matter to verify that the solutions to the feedback 
systems are related through w" = Nu l9 e[ = Ne l9 y[ = M^y Xi u % = 
M~ x u %9 e % — M~ x e %9 y\ = Ny %9 which then in view of the assumptions 
on M and N readily yields the theorem. 

Theorem 6.2 immediately leads to the following useful corollary. 

Corollary 6.2.1 \ 

Let W X =W 2 be Hilbert spaces. Let M 9 Ng &+(W u ,W u ) 
i@+(W U9 W Xe )) be regular in ^HW U ,W U ) (@+(W le ,W le )). Then the 
feedback system described by equations (FE) is finite gain stable 
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(Lipschitz continuous) if it is .well posed, if G' 2 = NG 2 M is (incre- 
mentally) positive on W 2e and if G[ — M^GxN" 1 is strictly (incre- 
mentally) positive and Lipschitz continuous on W v 

Proof: This corollary is an immediate consequence of Theorem 6.2 
and Corollary 4.3.4. 

As shown in Chapter 3, one often obtains positive operators using 
noncausal multipliers. Since such multipliers would violate the causality 
condition of Corollary 6.2.1, it becomes necessary to introduce 
factorizations. This is the subject of the following corollary. 

Corollary 6.2.2 

Let W x = W 2 be Hilbert spaces. Assume that Z e 3$(W %9 W 2 ) admits 
a factorization into Z = MN with 

1. Me&{W 29 W 2 ) is invertible, and Af*, (ikf- 1 )* e&+(W*,JVJ (i.e., 
M and M~ x are anticausal bounded linear operators); 

2. N e <%+{W 29 W 2 ) is invertible, and A^ 1 e @+(W l9 Wd\ and 

3. XG % is (incrementally) positive on W 2 and G t Z~ x is strictly 
(incrementally) positive and Lipschitz continuous on W x . 

Then the feedback system described by equations (FE) is finite gain 
stable (Lipschitz continuous) if it is, in addition, well posed. 

Proof: Since MNG 2 is positive on W 2J NG^M" 1 )* is positive on W % and 
since G X N~ X M" X is strictly positive and Lipschitz continuous on W X9 
M*G X N~ X is strictly positive and Lipschitz continuous on W x . The 
theorem then follows from Corollary 6.2. 1. 8 



6.3 A Stability Criterion for Linear Feedback Systems with a 
Periodic Gain in the Feedback Loop 

The ideas presented in Section 6.2 and the results obtained in 
Chapters 4 and 5 will now be applied to a particular class of feedback 
systems. This class consists of feedback systems with a linear time 

3 The theorems and corollaries of this section can be sharpened if one assumes 
that the preliminary conditions of Theorem 4.3 to be satisfied. Some such results are 
stated in Reference 5. The most significant relaxation is that it is then possible to 
state theorems involving a factorization of G*G X rather than requiring this factoriza- 
tion be of the type G % MM~ X G Y . 
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invariant operator in the forward loop and a periodic gain in the 
feedback loop. 4 The resulting system is shown in Figure 6.3. 




Figure 6.3 The Feedback System under Consideration in Section 6.3 

In terms of the notation used in the previous chapters, let the time- 
interval of definition S = [r o ,oo), V x = V 2 — R, i.e., u l9 e l9 y l9 w a , e 2 , 
y 2 are real-valued functions of time on [T ,oo), and let the operators G 
and K be formally defined by 



(Gx)(t)A2g n x(t-t n ) + ( 



g(t - t)x(t) dr 9 



To 

(Kx)(t) A k(t)x(t) 9 

where {t n } 9 neI+ 9 is a sequence of real numbers with t = 0, /„ > 
for n > 1 , g(t) is a real-valued function on R with g(t) = for t < 
and g e ^(0,00), {g n } e l x is a real-valued sequence, and k{t) is a real- 
valued function on [r o ,co) with k(t + T) = Jc(t) 9 for some T > and 
all / > T , with k(t) e L^J^oo). The functional equations describing 
the feedback system are: 

e,(0 = «,(0+7i(0. 
Ji(0 = (G 1 e^(t) 9 and 
y%{t) = (G % eJ(t). 



(LPFE) 



The solution space will be taken as L 2e (T 09 oo). The inputs u l9 u 2 e 
L^eiT^ °°)- The above operators have been studied in detail in Chapter 
3. It follows readily from Minkowski's inequality and some direct 
estimates that G and K map L 2e (T ,oo) into itself. They are in fact both 
Lipschitz-continuous linear operators on L 2e (T ,oo). The operator G 

4 There exists a large mathematical literature on the stability of linear differential 
equations with periodic coefficients. The specific results obtained here evolved out 
of the so-called "pole-following technique" ingeniously introduced by Bongiorno 
(Refs. 6, 7) and exploited, among others, by Sandberg (Ref. 8) and the author 
(Ref. 9.) 



142 STABILITY CRITERIA OBTAINED USING MULTIPLIERS 

is time invariant. The feedback system described by the equation 
(LPFE) thus satisfies the assumptions W.l through W.4, G.l through 
G.4, and LI, and the theory developed in Chapter 4 is applicable. 
Notice that the conditions on g, {g n }, and kit) imply open-loop stability 
of the feedback system. 

As an example of this, consider the linear time- varying ordinary 
differential equation 

p(D)x(t) + kV)q(D)x(t) = 0, D* = £ . 

df 

The following assumptions are made: 

A.l. The functions p(s) and q(s) are real polynomials in s> i.e., 

p(s) = S n + Pn-iS"- 1 + • • * + p > 

q(s) = q n s n + qn^s"- 1 + - • • + q 0f 

with p t and q i real numbers. 
A.2. The function k'{t) is a real-valued piecewise continuous function 

of t and belongs to L^. 
A.3. Either of the following conditions is satisfied: 

1. q n = 0; or 

2. q n *0Bnd-llq n $[0L,P\, 

where a and fi are such that a < k'(t) < /? for all t e i?. :' 

A real-valued continuous function x(t) is said to be a solution of 
this time- varying differential equation if it possesses (n — 1) continuous 
derivatives and if it satisfies the differential equation for all t for which 
k'(t) is continuous. Clearly x(t) = is a solution. This solution is called 
the null solution and is said to be asymptotically stable if all solutions 
approach the null solution for t -> oo. 

Asymptotic stability of the null solution of the time- varying differen- 
tial equation can be deduced from Instability of a feedback system of 
the type which is being considered in this section. Assume therefore 
that there exists a real number a such that the zeros of the polynomial 
p{s) + ccq(s) have a negative real part. 

It can be shown without much difficulty (see Ref. 9) that the 
differential equation can be rewritten as 

Pi(&)x(t) + *i(0?i(i>M0 = o> 

with p^s) a monic Hurwitz polynomial of degree n (i.e., all its zeros 
have a negative real part, and the coefficient of s n is one) with the degree 
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of p x {s) larger than the degree of q x (s). This nth order scalar differential 
equation is equivalent to the first-order vector differential equation 



dz{t) 
dt 



- Az(t) + bu(t\ 



with 

y(t) « c'z(f) 
and 

u(f) = woxo, 

where 



\ 



z(t) 



« col ( x(t) MO ^x(t) \ 



A = 








dxjt ) 
dt 

1 









1 





_-7>i.o —Pi,i ~Pl,2 * ' ' — pl,n~l_ 
b = col (0, 0, ... , 0, 1), 
c = col (f li0 , q ltU . . . , q ltnr _d, 
c\Is - A)^b = qMIPxis). 

The null solution of the differential equation under consideration 
will then be asymptotically stable if and only if given any z(0), 
lim^^ ||z(0ll exists and is zero. It is well known that the smoothness 
conditions on k'(t) are sufficient to ensure the existence of a unique 
solution which assumes the value z(0) for t — 0. Furthermore, the 
solutions satisfy the integral equation 

z(t) = e At z(0) -f^^bk^yir) dr for t > 0, 
which implies that 



y(t) 



c'e At z(0) - [y^^bk^yir) dr for t > 0. 
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It is clear that this last equation represents the feedback system of 
Figure 6.3 with T = 0, 

"i(0 = 0, «i(0 = r(t) A c V*'z(0) 



J>i(0 = y(t), 

[c'e M b for 
g(0 = 



for t > 0, 
*t(0 = X0 + K0» 

k(t) = fe t (r), and 



10 otherwise, 

g k = for all fc e J. 

It follows from the assumption on the zeros of p x (s) that all eigenvalues 
of A have a negative real part and thus that c'e At b eL (0,oo) for all 
P>1. 

Thus, proof of Instability for the feedback system implies that all 
solutions z{t) to the vector differential equation which are such that 
y eL 2e (0,oo) also belong to L 2 (0,co). Since all solutions z{t) are 
continuous, all solutions y(t) do belong to L 2e (0,oo) and hence all 
solutions yield j eL 2 (0,oo). Since 



z(t) = e M z(fy + [' e A{t ' T) bk x {r)y{ r ) dr 



and ^eLf n (0,co), A^eL^ao), e At z(0) eL 2 (0,oo) and the con- 
volution of an L r function with an L 2 -function yields an L 2 -function, 
it follows thus that z e L 2 (0,oo). Furthermore 

dz(t) 
dt 



Az(t) - k^bc'zit), 



hence dzjdt e L 2 (0,oo). Since z and dzjdt belong to Z, 2 (0,oo), lim^^ z(t) 
exists and is zero. Hence, Instability of the above feedback system 
implies asymptotic stability of the null solution of the differential 
equation. 

These simple manipulations show that although it might at first 
glance seem that the type of stability which is obtained in the theorem 
in the previous section is not as strong as Lyapunov stability, in most 
circumstances it actually implies it. 

Notice that the periodicity of k{t) was not used in the preceding 
argument. Using the periodicity of k(t) it can in fact be shown by 
invoking Floquet theory (Ref. 10) that if the system described by 
equation (LPFE) is L r stable, then it is L^-stable for any 1 < p < oo. 
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Feedback systems of the type described by the functional equations 
(LPFE) or by the ordinary differential equation occur frequently in the 
design of systems containing parametric devices. The stability proper- 
ties of such systems are of course of primary importance, and criteria 
using frequency-domain conditions similar to the Nyquist criterion 
have proven to be a particularly useful tool. Moreover, the local 
stability of a periodic solution of a nonlinear differential equation is 
equivalent to the stability of the null solution of a linear time-varying 
differential equation of the form illustrated here. 

The stability properties of the feedback system under consideration 
have received a great deal of attention in the past, and the result that is 
best known is the circle criterion, which was discussed in Chapter 5. 
Although^the circle criterion is applicable under much weaker con- 
ditions (the feedback gain need not be linear or periodic) than the ones 
stated on p. 141, it was originally proved making essentially the same 
assumptions. 

In this section a new frequency-domain stability criterion is de- 
veloped which assumes explicitly that the feedback gain is linear and 
periodic with a certain given period. This assumption makes it then 
possible to obtain an improved stability criterion. The result gives — for 
a particular transfer function of the forward loop — combinations of 
the lower bound a, the upper bound /?, and the period T of k(t) that 
yield stability. This dependence on the period is of course as expected 
and has been investigated exhaustively for certain classical types of 
second-order differential equations. The criterion to be stated in 
Theorem 6.3 requires, as do most recent frequency-domain stability 
criteria, the existence of a multiplier having certain properties. With 
the exception of the Popov criterion, however, there is generally no 
procedure offered to determine whether or not such a multiplier exists 
for a given transfer function of the forward loop (see Ref. 3). This is 
not the case for the criterion presented here, since Theorem 6.3 can be 
completely rephrased in terms of this transfer function. In fact, a simple 
graphical procedure is given to determine whether or not the multiplier 
exists. 



Theorem 6.3 

Assume that the feedback system described by the functional equa- 
tions (LPFE) is well posed, and that: \ 

1. a + e < k(t) = k(t + T)< )8 - e for some € > and almost all 
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2. inf Res>0 11 + KG(s)\ > for some Ke [a,j8], where G(s) denotes 
the Laplace transform of (g 9 {g nt t n }); and 

3. there exists a complex-valued function F(jco) of the real variable, a>, 
with F(jco) = F(—jto) such that for almost all m > 

FA. Re {F(jeo)} > € for some e > 0, 

F.2. F(jco) - FQ(© + 27TT- 1 )) e Z^, and 

\ olG(jco) + 1J 

Then the feedback system is L 2 -stable. 

Proof: It can be shown that the condition of the theorem implies that 
inf Re s>0 1 1 + JK7(s)| > for any K e [a,/?]. The remainder of the proof 
is divided into three steps. 

A. Consider the transformation of the feedback loop shown in 
Figure 6.1 and treated in Theorem 6.1 with k ~ a and the roles of 
the forward and the feedback loop reversed (i.e., the feedforward is 
taken around the feedback path). It thus suffices to prove stability for 
the system with G' = G(I + qlG)~ 1 in the forward loop and the gain 
k(t) — a in the feedback loop. This transformation is now repeated with 
the roles of the operators reversed (i.e., the feedforward is now around 
the forward path) and k = — I/O? — a). It thus suffices to prove 
stability for the system with G" = G(I + olG)' 1 + [1/(0 - a)]/ in the 
forward loop and the gain (k(t) - <x)(l - [1/(0 - a)](jfc(f) - a))" 1 
the feedback. A simple manipulation now shows that 



in 



G" 



1 I±JG 



fi — oc J + <xG 
and that the gain in the feedback loop equals 

k(t) — a 



k»(t) = (0 - a) 



P - *(0 ' 



B. The assumptions on F(jco) do not suffice to ensure that it has a 
Fourier series in l v However, since G(jco) is uniformly continuous for 
— co < a> < oo and lim,^*, G(jco) = 0, it follows that F(jw) may 
always be taken to have a Fourier series in l x , say {f n }, n e I, (in fact, 
it may be assumed that F(jco) has a finite Fourier series). 

C. Since Re F(;a>) > e >0, F^ijco) e L m . It is now a simple 
matter to verify that the operator defined on L 2 (— oo, + oo) by 
(Zx)(t) = 2*ei/n*(* — w ^) is invertible and that by Theorem 3,6 the 
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operators G ,f Z~ x and ZK" are respectively positive and strictly positive 
bounded linear operators on L % (— co, + co). The manipulations as in 
Corollary 3.2.2 and the factorization of Theorem 3.20 now show that 
the c<5nditions of Corollary 6.2.2 are satisfied, which thus yields 
stability as claimed. The fact that the operators can be considered as 
operators on Z^T^oo) follows from the causality of the operators 
after the proper factorizations have been carried out. 

Theorem 6.3 is not very useful as it stands since it leaves un- 
answered the question whether or not the multiplier F(jco) exists. This 
question can be resolved, however, and this leads to an equivalent 
formulation of the theorem. 

Let 

^max(^) = SUP (f)(0) + ^TrjT* 1 ), 

<£minO>) = inf <j>(a> + hIttT- 1 ) 

nel 

where 



<f>(o>) = arg 



mjco) + 1 



aGO'oO + 1 
The alternate formulation of Theorem 6.3 is then: 



Theorem 6.3A 

Assume that the feedback system described by the functional 
equations (LPFE) is well-posed, and that: 

1. a + e < k(t) = k(t + T) < P - € for some e > and almost all 



t>T { 



o> 



2 - inf Be s ^ I 1 + KG ( S )\ > ° for some Ke t a >£L where G(s) denotes 
the Laplace transform of (g,{g n >t n })i an ^ 

3. <AmaxO) - &nin(°>) < * f° r all \o>\ < irT~\ 

Then the feedback system is Instable. 

Proof: Since G(jco) is a uniformly continuous and bounded function of 
to, the sequence of functions G(jco + nl-irT- 1 ), nel, is equicontinuous 
and thus <£ max (a)) and <f> min (a>) are continuous functions of a). Hence, 
l^maxC^) — ^minC^OI is a continuous function of co. Since by symmetry 
&*ax and <£ min are periodic, \ 

^max(^) ~ ^min(^) = &naxO + ^T^ 1 ) - <f> min (co + 2ttT~^ 



148 STABILITY CRITERIA OBTAINED USING MULTIPLIERS 

Since |^ max (o>) — <f> min (co)\ < n, there exists an e > such that 

l^max( w ) - <AminO)i < * ™ € - L et 

F(jai) = exp {-i/[^ max (w) + ^ min (w)]}. 

It is easily verified that this choice for F(jco) yields the conclusion by 
Theorem 6.3. For the converse part of the equivalence, assume that 
&nax<V) — <£min(V) = ^ for some <*>' e &- Then, since Re {G(ja))F(jco)} 
has to be nonnegative for all to, this implies that |arg F(ja)')\ > tt/2, 
which contradicts the condition that Re F{jco) > e > 0. 

The following two corollaries show that the criterion is a trade-off 
between the circle criterion (T arbitrary) and the "frozen time" Nyquist 
criterion (T small). 



Corollary 6.3.1 

The feedback described by the equations (LPFE) is L 2 -stable if it is 
well posed, if k(t) is periodic, and if: 

1. a + e < k(t) < /J — e for some e > and almost all t > T ; 
2 - i nf ke s >o I 1 + KG(s)\ > for some K e [<x,j8] where G(j) denotes the 
Laplace transform of (g 9 {g n9 t n }); and 

W») + 1 > 



3. Re 



*G(ja>) + 1 



Proof: Take J^O'co) — 1 and apply Theorem 6.3. 

Corollary 6.3.1 is a particular case (since it assumes the feedback 
gain to be linear and periodic) of the circle criterion. 

Consider now the stability properties of the linear time-invariant 
system obtained by replacing k{t) in the feedback loop by k t = k(t) 
for some r. Even though the time-invariant system thus obtained is 
Z^-stable for all constants k u it does not follow in general that the 
original feedback system is L 2 -stable (see Ref. 1 1). This fact is closely 
related to the Aizerman conjecture for time-invariant systems to be 
discussed in the next chapter. However, the following corollary shows 
that this procedure is legitimate if the period Tis sufficiently small. The 
corollary states that if the frequency of the feedback gain is sufficiently 
high compared to the natural frequencies of the forward loop then no 
instability due to "pumping" can occur. 5 

5 This result is not the sharpest one possible. Indeed, it can be shown (Ref. 10) 
that for the period sufficiently small, it suffices that the constant feedback system be 
stable when k(t) is replaced by its average value. 
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Corollary 6.3.2 

Assume, in the definition of G, that g n = for all n > 1 and that 
the feedback system is Instable for any k(t) = k = constant in the 
feedback loop with a < k < /?. Then there exists a T x such that for all 
T < T x the feedback system with any gain a < kit) — k(t + T) < /? 
in the feedback loop is also L a -stable. 

Proof: Since lim^^ G(jto) = g Q exists (by the Riemann-Lebesgue 
lemma) and is real, lim^-^ <f>(<o) exists and is zero. Since the feedback 
system is Instable for constant gains k in the feedback loop with 
a < k < ft, there exists a function of Z(jo>) such that for all ct>, 



Re Z(jco) > e > and 



J olG(jco) + 1^ 



(This follows from the Nyquist diagram and a simple graphical con- 
struction.) It thus follows that for a> sufficiently large the function 
jp(7o>) = Zijcoi) for |o>| < co I2 and F(jco) = F(j(co + co )) otherwise, 
will yield the conclusion by Theorems 6.3. 

93 A Application of the Criterion 

Theorem 6.3A suggests an obvious graphical procedure for de- 
termining whether or not Theorem 6.3 predicts Z 2 - stability. This is 
illustrated in Figure 6.4, and requires plotting the curves </> N (Q.) — 
<f>(Q, + Nco ), a) = InT- 1 , versus £i for |Q| < co /2 and Nel. The 




-7T/2 \ 

Figure 6.4 Graphical Procedure for Determining F(jco) 
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upper and lower envelope of these curves give <£ max (Q) and ^^(H). 
Theorem 6.3 then requires for Instability that ^ max (0) — ^ min (Q) < -n 
for all \£l\ < co /2. It is apparent that this procedure, although straight- 
forward, is rather tedious. 

In order to facilitate the application of the criterion, some simple 
necessary conditions for the multiplier F(s) to exist are now given for 
the case < a < /?: 

1. The Nyquist locus of G(s) should not encircle or intersect the straight 
line segment [— 1/a, — 1//8] of the negative real axis of the Nyquist 
plane. 

2. The points G(yweo /2), n = 0, 1, 2, . . . , should satisfy the conditions 
of the circle criterion; i.e., they should not lie inside the closed disk 
centered on the negative real axis at — J(l/a + 1//?) with radius 
*d/« - 1/0. 

Analogous conditions hold for other ranges of a and fi. 
The second necessary condition follows from the fact that, since 
F(jco) = F(— joy), and since F(j(a> + co )) = F(jto), then 

F(jnco /2) = Re F(jnco /2) 



for« = 0, ±1, ±2,.. 
imply that 



Thus conditions F.l and F.3 of Theorem 6.3 



RcW na> /2)4-l >() 
aG(jnft> /2) + 1 



for 



nel, 



which leads to the second necessary condition. 

By choosing particular functions for F(jcd) it is of course possible to 
obtain other sufficient conditions for Instability. The next corollary is 
based on this idea and gives a quite simple sufficient condition for the 
multiplier F(jco) to exist. It is expressed entirely in terms of the Nyquist 
locus of G(s), and is stated here for the case < a < /?. 

Corollary 6.3.3 

Assume, in the definitions of G, that t n = nT x for some T x > 0. Then 
the feedback system described by equations (LPFE) is Instable if it is 
well posed, and if: 

1. the Nyquist locus of G(s) does not encircle the point —1/a on the 
negative real axis of the Nyquist plane; and 



i 
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2. there exists a circle, C, which passes through the points —1/a and 
— 1//?, such that the Nyquist locus of G(s) for ay > does not 
intersect it. 

Let C be the mirror image of C with respect to the real axis, and 
consider the following two parts of the Nyquist locus of G(s) : 

S x : {G(jco) [ mo, < co < (n + l/2)a> }, and 

S a : {G(jco)\ (n + l/2)o> <<»<(« + l)co }, 

where n e /+ Then 

3. C does not intersect both S x and S 2 , 
This corollary is illustrated in Figure 6.5. 



ImG(ja;) 




>o /2 



Figure 6.5 Illustration of Corollary 3.3 

Proof: Condition 1 assures that the third condition of Theorem 6.3 is 
satisfied. Let |0| < tt/2 be the angle between the positive real axis and 
the straight line through the origin of the complex plane defined by the 
points 



ftr+1 

OCT + 1 



eC 



\. 



Assume that 6 > and that C does not intersect S % (a similar argument 
establishes the corollary for the other cases). Let F(jco) be a function 
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of a) such that fotnel 



argF(»-{-(7r/2-e) 




for nco < co < (n + I/2)a> 
for (n — l/2)co < co < «co . 
for co = nft> , (« + l/2)ct> 



Clearly, F(/a>) satisfies conditions F.l and F.2 of Theorem 6.3. 
From condition 1 of the corollary it follows that 

— 7T + 6 < <f>(o)) < e 

for co > and na) Q < co < (« + l/2)co ; and from the fact that C does 
not intersect S z it follows that 

-0 < <£(ft>) < 7T - 

for o> > and (n — l/2)to < co < «co . Thus it follows that 
-tt/2 < arg F(;co) + ^(co) < tt/2 

for co > 0, which establishes condition F.3 of Theorems 6.3 since 
arg F(-jca) + <£(-co) = - arg F(/co) - </>(co). 

The following two examples illustrate the usefulness of Theorem 6.3 
and its corollaries. 
1. Let 

G(s) ~ . 

(s + 10)(s 2 + 0.4s + 1) 

k{t) « k(t + T) and < k(t) < 2. Determine for which range of 



ImG(jcu) 
0.1 



ReG(jo u) 




Figure 6.6 Nyquist Locus of sj(s + 10)(j 2 + OAs + 1) 
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eo = 27r/r this feedback system is stable. The Nyquist locus of G(s) is 
shown in Figure 6.6. 

It is apparent from the Nyquist locus that the circle criterion cannot 
be used to predict Instability. Using the procedure suggested at the 
beginning of this section, Theorem 6.3 shows that this feedback system 
is L 2 -stable for all k(t) in the determined range provided co > 1.55. 
Using Corollary 6.3.3, on the other hand, this feedback system is found 
to be Instable for all k(t) in the given range provided eo > co r = 3.3. 
(This number co r was obtained as follows: Let AB be the tangent to 
the Nyquist locus through the point (— 1/2 + 0/); let A C be the line 
symmetric to AB with respect to the real axis. The intersection of the 
Nyquist locus and AC then gives co r /2.) 

This example shows that, although Corollary 6.3.3 did not give an 
excellent estimate, it is quite simple to apply. 

2. Let G(s) = l/s(s + 2). Determine K(co Q ) such that the feedback 
system is L 2 -stable for all k{t) = k(t + T), co Q — 2tt/T, and < e < 
k(t) < K(co Q ). The Nyquist locus of G(s) is shown in Figure 6.7. Using 
the circle criterion, one obtains K(co ) = 4. Brockett (Ref. 11) has 
shown by examining the worst possible variation in k(t) that K(co ) = 
11.6. Applying Theorem 6.3 and the graphical procedure outlined in 
this section results in K(co ) as shown in Figure 6.8. The same figure 
also shows the result obtained using Corollary 6.3.3 and a graphical 
construction analogous to the one used in example 1. Thus, by re- 
stricting the feedback gain to the periodic, it was possible by means of 
Theorem 6.3 to obtain higher values of K&s the frequency was increased. 







ImG(jcu) 




0.1 


-0.25 


O ReG(ju>) 
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Figure 6.7 Nyquist Locus of l/s(s + 2) 
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' Theorem 3 

Corollary 3 
Ref.ll 




2 4 6 

Figure 6.8 Regions of Stability for Example 2 

Remark: It follows from example 2 that the converse of Theorems 6.3 
is false; i.e., if F(jco) does not exist, then there will in general not 
necessarily be a k(t) in the required range such that the feedback 
system is not Instable. 

Theorem 6.3 has, as is to be expected, an instability converse. It is 
stated in Theorem 6.4 and can be proved using the methods developed 
in Chapter 4 and the inequalities introduced in this section. 

Theorem 6.4 

Assume that the feedback system by the functional equations 
(LPFE) is well posed, and that: 

1. a + € < k{t) = k(t + T) < £ - e for some e > and almost all 
t> T ; 

2. inf Re ^ 11 + KG(s)\ = and inf ResM) |1 + KG(s)\ > for some 
Ke [a,/?] where G(s) denotes the Laplace transform of (g,{g n ,t n }); 
and 

3. there exists a complex-valued function F(jco) = F(—j<o), of the 
real variable co, such that for almost all a> > 0: 

FA. Re {F(ja))} > € for some e > 0, 

F.2. F(jco) - F(j(a> + lir^)) e L^ and 

jgG(jco) + 1) 



Then the feedback system is L 2 -unstable. 
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6.4 A Stability Criterion for Feedback Systems with a Monotone 
or an Odd-Monotone Nonlinearity in the Feedback Loop 

As a second class of stability criteria for feedback systems derived 
by means of multipliers, consider the system with a time-invariant 
operator G, in the forward loop and a monotone or an odd-monotone 
nonlinearity in the feedback loop. For convenience and in order to 
emphasize the generality of the approaches outlined in Chapters 2 and 4, 
the results will be derived for systems described by difference equations. 
With some modifications similar results can be obtained for the 
continuous case. The adjustments in the theorems involve choosing 
the integers as the set S and making some minor technical changes. The 
feedback system which will be considered is shown in Figure 6.9. 



K) 



*D 



w 



h) 



W 






K>1 



M 



Figure 6.9 The Feedback Loop under Consideration in Section 6.4 
Definitions: The operators G and F are formally defined by 



G({**})* = 2 8*iXi> k e /+ 



and 



i<~r 



n{x k }) k =f(x k ) 9 kel + , 
where it is assumed that: 

1. G e.2?+(/ a ,/ a ), i.e., that G maps / 2 into itself and that g kl = when- 
ever k < /; and 

2. /is a mapping from R into itself fir which there exists a k such that 
|/(cr)| < K\a\ for all a e R. 

It is simple to verify that under these conditions G and F map / 2e into 
itself and that they are bounded and causal. The system is thus again 
assumed to be open-loop stable. 
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The equation describing the forward loop of the feedback system 
is thus 

l*I + 

The array {g kl } is often referred to as the weighting pattern of the 
system. This system is slightly more general than the input-output 
relation governed by the ^-dimensional difference equation 

X *H-1 ~ ^pCfc + b k u k9 

y k ^c f k x k + d k u k , kel\ 
x = given, 

where b k and c k are ^-vectors, d k is a scalar, A k is an (n x n) matrix and 
x k is an n-vector called the state of the system. This input-output 
relation is a particular case of the input-output relation defined by the 
above summation with 



gm = 



\c , k A k _ 1 , . . A l+1 b z for k > J + 2 

c k b k for k = I + 1 

d k for k = I 

kO otherwise, 



r k = c 'k A k-i • • • A o x o for k > 1, 



and 



r — ^o-^o* 

The case in which the system is time-invariant is of particular 
interest. The system is then defined by the equation 

where g fc is assumed to be zero for k < 0. This system is slightly more 
general than the input-output relation governed by the ^-dimensional 
difference equation 

x k+1 — Ax k + bu k9 
yic = c'Xk + d, kel+, 

where b and c are constant w-vectors, d is a scalar constant, A is a 
constant (« x n) matrix and x k is an w-vector called the state of the 
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system. This input-output relation is a particular case of the input- 
output relation defined by the above summation with 

g k = c'A^b for k > 0, 

£o = d 

g k = for * < 0, 

r & — c'y4 fc x for k > 0. 

The equation describing the feedback loop is 

and the closed-loop equation of motion becomes 



JeJ" 



ieZ 



It will be assumed that this equation is well posed. A sufficient condition 
for well-posedness is that g kk = for all fc e /+. 

The feedback system under consideration is said to be Instable if for 
all / 2 -sequences r — {r k } and v = {v k }, all solutions {y k } belong to / 2 
and satisfy the inequality 

(lyl) <Pi(lvl] + P Jzrl) 

for some constants p x and /> 2 - 
Remark: Notice that / 2 -stability implies that 
lim y k = lim f{y k ) = 0, 

jfc-*oo ft-*co 

and that for the w-dimensional difference equation described above it 
implies that if v k = for all k then lima, -o su P&e/+ IjxJ = °> which in 
turn implies asymptotic stability in the sense of Lyapunov provided the 
system is uniformly observable. 

Notation and Definitions: The operator isis said to be monotone (or 
odd-monotone) if f(a) is a monotone (or an odd-monotone) function 
of c F is said to be strictly monotone (or strictly odd-monotone) if 
f(o) — ecr is a monotone (or an odd-monotone) function of a for some 

€>0. 

Application of the principles described in the beginning of this 
chapter and the positive operators introduced in Section 2.5 lead to the 
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following stability theorem. 6 The reader is referred to that section for 
the nomenclature. 



Theorem 6.5 

A sufficient condition for the feedback system under consideration 
to be / a -stable is that: 

1. G belongs to -2? (4,4) an d F is strictly monotone (strictly odd- 
monotone), and bounded; and 

2. there exists an element Z of JS?(/ 2s / 2 ), such that Z — e/is doubly 
hyperdominant (doubly dominant) for some e > and such that 
ZG is positive on / a . 

Proof; This theorem is a straightforward application of Corollary 
6.2.2 if it can be shown that Z can be factored as required there. This is, 
however, precisely what is stated in Corollary 3.18.1. 

The case in which the system is time invariant and the multiplier is 
of the Toeplitz type is, of course, of particular interest. The positivity 
condition and the doubly hyperdominance (doubly dominance) con- 
dition can then be stated in terms of ^-transforms. This is done in the 
Corollary 6.5.I. 7 

Lemma 6.1 

Let R == {r fc _J, k, 1 e /+ define an element of jSf (/ a ,/ 2 ) which is of the 
Toeplitz type. Then a necessary and sufficient condition for the inner 
product (x,Rx) to be nonnegative for all / r summable sequences x is 
that the z-transform of {rj, R(z) 9 satisfy Re {R(z)} > for almost all z 
with \z\ = 1. 

Proof: It is well known that 

<jc,«x> It = ~ & R(z) \X(z)\* z- 1 dz 
2tt Jul— i 



= — [* R{e j »)\X(e jio )\*da> 
= — \*YL&R(e'^\X{e'")\*d<x> 9 



and the conclusion follows. 

6 This result is a generalization of similar results obtained by O'Shea et aL (Refs. 
12-14) and Zames and Falb (Ref. 15). 

7 Corollary 6.5.1 is the result of Reference 14. 
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Corollary 6.5.1 

A sufficient condition for the feedback system under consideration 
to be 4-stableis that: 

1. G is a Toeplitz-type element of ^(/ 2 ,/a), and Fh strictly monotone 
(strictly odd-monotone) and bounded; and 

2. there exists a Z(z) such that Z(z) — e is the z-transform of a hyper- 
dominant (dominant) sequence for some e > and such that 
Re {G(z)Z(z)} > for almost all z with \z\ = 1. 

Proof; This corollary follows from Theorem 6.5 and Lemma 6.1. 

Remark; For the /i-dimensional time-invariant difference equation 
introduced on page 156, the operator G will belong to ^(/ a ,/ 2 ) if all 
eigenvalues of A have magnitude less than unity. 
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7 Linearization and Stability 



Linear systems are much simpler to design and analyze than nonlinear 
systems. This is the reason why engineers resort to linear models if this 
is at all possible and that otherwise a nonlinear system is very often 
linearized for design purposes. This chapter examines the relationships 
between stability and continuity of nonlinear systems and their 
linearizations. It will be shown that a nonlinear system is Lipschitz 
continuous if and only if its linearization at any point is continuous and 
that a system whose linearization at the origin is not continuous is not 
finite-gain stable. 

The second part of this chapter contains an account of some of the 
linearization procedures that are frequently used in engineering design. 
These include the describing function linearization (often called the 
equivalent gain or the method of the first harmonic), the total gain 
linearization, and the incremental gain linearization. An attempt will 
be made to characterize the philosophy of these methods and to draw 
attention to their deficiencies. To illustrate this point, the chapter thus 
ends with a class of counterexamples to Aizerman's conjecture. 

7.1 Linearization 



The reader is referred to Chapter 2 for the nomenclature and the 
notation used in this section. Particularly the definitions introduced in 
Sections 2.4 and 2.6 are freely used. 
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LINEARIZATION AND STABILITY 



Definition: Let W x and W % be Banach spaces and let F be a (in general) 
nonlinear operator from W x into W % . Let x Q e W x and assume that 
there exists a bounded linear operator L x eJ?(W x ,W z ) such that 



lira 

ll»H-o 



\\F (xp + x) - Fx - I^x i| ^ 2 



11*1 



= 0. 



Wi 



Then Z^ will be called the linearization of P at x . 

It is in general not clear whether or not a particular operator admits 
a linearization at a given point. This, as differentiation, indeed requires 
some smoothness on F. The linearization L_ is well defined whenever 
it exists (i.e., it is unique) and preserves some of the properties of F 
(e.g. , its causality). 

Some elementary properties of linearizations are: 

1. If F admits a linearization at x , then Pis Lipschitz continuous at x . 

2. Let Pi and P a be operators from W x into W % which admit lineariza- 
tions, L Xx and L^ , at x e W x and let a be a scalar. Then F x -f P 2 
and olF x admit linearizations at x . These are, respectively, L laj + 
L^ and aZ, lv 

3. Let F x and P 2 be operators from, respectively, W x into W % and JF 2 
into PT 3 which admit linearization L Xx and L 2F x at, respectively, * 
and F x x Qt Then L 2jF a L la . is the linearization of F 2 F X at x . 

4. If P e JSf (H^, W z ) then L B ° exists for'all x e W x and L x = F. : 

Less elementary but very essential is the following property of 
linearizations. W x and W z are from now on assumed to satisfy the 
axioms which warrant for causality properties and extensions. 

Theorem 7.1 

Let Pbe an operator from W x into W 2 and let x e W x . Assume that 
F admits the linearization L x at x . Then L x is causal (strongly casual, 
anticausal, memoryless) on W x if F is causal (strongly causal, anti- 
causal, memory less) on W x * 

Proof: Let F be causal and assume, to the contrary, that L x is not. 
There then exists an x e W x and T e S (the time-interval of definition) 
such that P T L X x^O and P T x — 0. Then 

lim H p y^ x Q + a *) ~ Fx * ~ <*- L x«A\\w* ^ Q 
«-*o lalllxH^ 
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since 

l|P y [P(*o + <**) - Fx o - *L*x]\\w* 

< \\F(x Q + ax) - Fx - cdL x x\\ w% . 
However, 

P r [P(;c + <**) - ^o] = P T [FP T (xo + ax) - JP^J 
= P T (FP T x — PPyXo) 

= 

implies that 

^ \\P tL X0 x\\ W 2 _ l1PrIW*llTT, 



lim 

a r2 \\ x Wwx 



II^IItfi 



which yields the desired contradiction. The other cases are proved in a 
similar fashion. 

One of the main points of this monograph is the importance of 
causality of operators and the resulting possibility of analyzing systems 
on extended spaces. The question thus arises whether or not the 
definition of linearization can be extended to cover causal operators on 
extended spaces. This is indeed possible. 

Definition: Let W x and W % be Banach spaces defined on the time- 
interval of definition S and let W u and W 2e denote their extensions (it 
is thus assumed that W le and W 2e satisfy the axioms of extended spaces). 
Let F be a causal operator from W le into W u . Then the operator L Xo 
from W u into W 2e is said to be a linearization of F at x if it is linear and 
if for all TeS, P T L Xo is a linearization of P T F at P T x . It is easily 
verified that L Xo exists for any causal operator which has a linearization 
at P T x for all* Te 5. This follows from Theorem 7.1 and by defining 

JT rrtlLi, 



T^Xq 



to be the linearization of P T F at P T x . The resulting linearized 
operator L Xo is then defined on the extended space by (P T L x x)(t) — 
(L XQtT x)(t) for / < T, * e S, with L XQtT the linearization of P T F at 
P T x . Clearly this provides an alternative characterization of L Xo . 
Linearizations on extended spaces are again well defined whenever they 
exist and satisfy the elementary properties 1-4. The following theorem 
is now obvious. 



Theorem 7.2 

Let P be a causal operator from W l0 into W 2e and let x Q e W Xe . 
Assume that F admits a linearization L Xo at x . Then L Xo is causal 
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(strongly causal, memoryless) on W u if F is causal (strongly causal, 
memoryless) on W u . 

The following examples serve to convince the reader that the 
functional linearization considered here specializes to the more familiar 
concepts of linearization as encountered in the theory of functions on 
Euclidean spaces and in the theory of ordinary differential equations. 

1. Let /: R n x S-+R m be differentiate for all asR n and for 
almost all t e S and assume th^at for some K < oo, \\f(a l9 t) — 
/(*a.OII < Jn°i ~ Call f^ all a l9 a % eR n and almost all t e S. Let 
{Fx){t) J±f(x(t) 9 t). It is clear that F is a well-defined, memoryless 
operator from Lf*(S) into Lf™(S) for 1 < p < oo and that F admits a 
linearization for all x e Lf". In fact, (F XQ x)(t) = B £(x (t) 9 t)x(t) 9 where 
d £(cf,t) is the {m x m) Jacobian matrix (the (i\j)th entry of 9//3<r equals 

§f,(*,0/9*y). 

2. Consider the nth order ordinary differential equation 

x = f{x 9 u 9 t) 9 

y = g(x 9 u,t), 

with S = [To, oo), m e it™, x e R n , y e jR & , and x(* ) e £ n given. Assume 
appropriate smoothness conditions on /and g (e.g.,/, g uniformly 
Lipschitz and differentiable). Let F be defined by y A Fu 9 where u 
generates y through the differential equation. Then F maps L R ™(S) 
into Lf*(S) 9 1 < p < oo, is causal on L^(S) 9 and F admits a lineariza- 
tion for any w Q e Lf™(S) with L Wfl defined through the linear ordinary 
differential equation 

Ax = •— (x 09 u 09 t) Ax + ^ (x 09 u 09 i)u 9 
ox ou 

y = r^ (x ,u ,0 Ax + -£ (x 09 « ,0«, 

9x ou 

with Ax(/ ) = 0, w given, and x the corresponding solution. 



The above examples demonstrate that time invariance of operators is 
not preserved under linearization. If, however, the element x e W u is 
itself constant for all t e S 9 if S = (— oo , + oo), and if Fis time invariant, 
then it follows readily that L Xo will also be time invariant whenever it 
exists. 
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Theorem 7.3 

Let Fe S§(W 9 W) be regular in $(W 9 W) and assume that F has a 
linearization L Xo at x Q e W. Then L" 1 exists and is the linearization of 
F" 1 at Fx . 

Proof that L Xo is one-to-one: Assume to the contrary that L x x = for 
some x e W 9 x # 0. Then 



lim ll*t*o + «) - *koll 



a->0 
a#0 



Ml 



=o, 



which shows that 



l|F(x + ax) - Fxpll 
;^o \\F~\F{x + ax)) - i^^Xol 

a#0 



lim 



= 0, 



and that F" 1 is not Lipschitz continuous at x . 

Proof that L Xo is onto: Let F be defined as F x — F(x + x ) — Fx and 
let F~ l be its inverse. This inverse exists since F is by assumption 
invertible. Consider now the equation x = Hx A x — L^Fp-x + y. It 
follows from the definition of a linearization that 



*i — X 2 



I*-^ 1 *! ~ i^ 1 *,)!! 







as 



*«ll 



0. 



Il^xi ~ *T*>II 
In particular since F" 1 is Lipschitz continuous, 

II* ™ *a ™ 4^*1 ~ F o lx ^\ < K ll*i ~ *»ll 

with K < 1 for H^ — x 2 || sufficiently small. This shows that H is a 
contraction on some sphere around the origin. A similar calculation 
yields that for y sufficient small H maps this sphere into itself 1 and thus 
has a fixed point. Hence the equation L^^x = y has a solution for j; 
sufficiently small which by linearity of L Xq shows that L x is indeed onto 
as claimed. 

Proof that L" 1 is bounded and that it linearizes F _1 at Fx : The inverse 
L" 1 is bounded by the closed graph theorem. To see that L~ l linearizes 
F~ x at Fx note that since 

HF-^Fxo + L Xo x) - F~ l Fx Q - x|| 

< IIF-1 ||Fx + L Xo x - F(x + x)||, 

1 The method of proof suggested here follows the usual proofs of the implicit 
function theorem (see, e.g., Ref. 1, p. 47). 
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it follows that 



lim 



IIJT^Xo + V) - F-'Fx, - x|| 



ll*ll-o 
which shows that 

J7-V 



11*11 



= 0, 



\ \F-\Fx + z) - F-*Fx - L^z\\ 
lim - — = 0. 



Hi5i«H-« 



\\L^z\\ 



Since ||L-^|| > \\L x y \\z\\ 9 it follows that 
\\F-\Fx, + z) - F~ x i% - L^zll 



lim 



= 0. 



l-o 



Theorem 7.3 and the definition of linearization on extended spaces 
combine to give 

Theorem 7.4 

Let F be a causal locally Lipschitz continuous operator from W e into 
itself with a causal locally Lipschitz continuous inverse on W e . Assume 
that F admits the linearization L x at x Q . Then L x is invertible, L Xq and 
L" 1 are causal and locally Lipschitz continuous on W eJ and F~ x admits 
the linearization L" 1 at Fx Q . 



7.2. Linearization, Stability, and Continuity 

In this section the fundamental relationships between stability and 
continuity of a nonlinear feedback system and of its inverse are con- 
sidered in detail. First, however, a fundamental lemma is proven. 

Lemma 7.1 

Let F be an operator from W 1 into W % and assume that F admits a 
linearization at every x e W^ Let L x denote this linearization. Then 
\\L X || is uniformly bounded in ^(W^W^) if and only if F is Lipschitz 
continuous on W x . In fact, F\\ A = sup^^ \\L X J\. 

Proof: It will be shown that \\F\\ A = sup^. eWi \\L X ||, where the equality 
means that if one side exists, so does the other, and they are equal. 
The inequality \\L X || < \\F\\ A follows immediately from the definition 
of linearization and yields sup^ 6>Fi \\L X \\ < ||jF|| A . To prove the converse 
inequality, let x,y e W x be given and consider the equality 

dF , 



Fx — Fy 



■f 



da. 



(x — <x.(y — x)) di 



-r 



L x ^ iy _ x) (x - y) da.. 
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This yields ||F* - Fy\\ < sup ae[01] !!£„<,-„, || \\x - yh and thus 

IIF1L A sup "^"^"^ < sup ||LJ|. 

x.yeWi \\X — y\\ Wl ^Wi 

Hence sup^^ \\L Xq \\ = \\F\\ A as claimed. 

Consider now the feedback system studied in Chapter 4 that is 
described by the functional equations 



e x = u x - y %9 
e* — w 2 + y l9 
y± = Gx*i> 



(FE) 



72 



G 9 e» 



and satisfies the conditions W.1-W.4, G.1-G.4, and LI enumerated in 
Chapter 4. The time-interval of definition is S — [T 0i co), and well- 
posedness imposes weak conditions on the operators G x and G 2 . One 
such condition has been given in Theorem 4.1. 

Assume that for some elements x x e W u and x z e W 2e the operators 
G x and G % admit linearizations L lxi and L 2a . 2 at, respectively, x 1 and x 2 . 
Consider now the functional equations associated with this linearization 
given by 



e± = Ul - y 2 , 
e' % = u' 2 + y' l9 

y[ = L ixA> 



(LFE) 



y% 



J 2a;a c 2* 



It follows immediately from Theoreifi 7.2 that these equations satisfy 
the assumptions W.1-W.4, G.1-G.4, and LI if u\ e W iey i = 1, 2. This 
system obviously describes a feedback system that will be called the 
linearized feedback system. Note that this linearized feedback system is 
linear but that the operators in the forward and the feedback loop 
depend on the point of linearization, (x 1$ x 2 ) e W le x W 2e . 

Theorem 7.5 exposes the relationship between continuity of a feedback 
system and its linearization. 2 

2 Theorem 7.5 is an infinite-dimensional version of a theorem of Palais (Refs. 2, 3) 
which essentially states that a nonlinear transformation on a finite-dimensional 
space is invertible if all its linearizations are invertible. Theorem 7.5 is by no means a 
generalization of this theorem to infinite-dimensional spaces, since it exploits 
causality and invertibility on extended spaces in a very essential manner. 
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Theorem 7.5 

Consider the feedback system described by the functional equations 
(FE) and assume that it is well posed. Assume that the operator G i9 
i = 1,2, admits a linearization at every x 4 e W ie . Then the (in general 
nonlinear) feedback system described by the functional equations (FE) 
is Lipschitz continuous if and only if the linearized feedback system 
described by the functional equations (LFE) is Lipschitz continuous for 
all (x u x z ) e W le x W 2e , uniformly in (x^x^). 

Proof: It can be shown that well-posedness of (FE) implies well- 
posedness of (LFE). However, since the thrust of Theorem 7.5 lies in 
the continuity implications and since the demonstration of the above 
claim is rather involved, its proof will be deleted. Turning now to the 
continuity proof, let G denote the operator defined on W le x W 2e by 
G(e u e^ = (-G^G^). Clearly, L(e u e 2 ) = (-L x e 2 ,L x e^ is the 
linearization of G at (* 1? x 2 ). By Theorem 7.4, (/ + G)~ x (which exists 
and is causal by well-posedness) admits the linearization (/ + L)~ x at 
(x l3 x 2 ). Thus by Lemma 7.1, 



||P r (/+G)- 1 P T IU= sup \\P T (I + L)-*P T \\ 



for all TeS. 



Now passing to the limit with T-> oo shows that these limits exist 
simultaneously — since both sides are monotone in T — and that they 
are equal if they exist. This is precisely the claim of the theorem. 

Theorem 7.5 thus shows that for Lipschitz continuity it suffices to 
consider the linearized system. The following theorem relates stability 
of the nonlinear system with the stability of its linearization at the 
origin. The resulting theorem is consequently a great deal weaker. 

Theorem 7.6 

Consider the feedback system described by the functional equations 
(FE) and assume that it is well posed. Assume that the operator G u 
i = 1, 2, admits a linearization at e W ie . Then the (in general non- 
linear) feedback system described by the functional equations (FE) is 
not finite-gain stable (and thus not Lipschitz continuous) if the feedback 
system linearized at the origin (i.e. , described by the functional equations 
(LFE) with (x x ,x 2 ) = 0) is not continuous. 

Proof: Let G denote the operator as defined in the proof of Theorem 
7.5 and let L be its linearization at the origin. Let u = (w l3 w 2 ) eW x x 
W 2 be the input to both the linearized and the nonlinear feedback 



TOTAL AND INCREMENTAL GAIN LINEARIZATION 169 

system. Assume now that the nonlinear feedback system is finite-gain 
stable; i.e., ||(/+ G)"! < oo. Since (J + L )~ l is the linearization of 
(/ + G)- 1 at the origin, it follows that for all TeS, 



= lim 

a-+0 



\\p T [(i + gt 1 ™- (i + ur 1 ^] 



> 



limi 

a-»0 \ 



l«l IN 

\\P T (I + J^rHiH ||P T (7 + G^at/ll 



|oc| ||«|| 



Mi 

Hence \\P T (I + L^uW < ||(/+ G)^ ||w||, which yields stability of 
the linearized system and the proof of the theorem. 

The ideas of Theorems 7.5 and 7.6 can be combined to show that a 
well-posed feedback system is continuous if and only if it is stable and 
its linearization at any point is continuous. This thus relaxes the 
uniformity requirement of Theorem 7.5, but introduces the explicit 
requirement that the feedback system be stable. Since this latter 
condition cannot be stated in terms of linearizations, the combined 
theorems cannot provide a test for verifying continuity by solely 
considering linearizations, and so the combination has not been stated 
explicitly in this section. 



7,3 The Describing Function, the Total Gain Linearization, 
and the Incremental Gain Linearization 

In the previous chapters, a number of stability criteria for nonlinear 
feedback systems have been derived. The question of whether or not 
these criteria are conservative cannot be given a general answer, but 
both from the estimates and from examples one suspects that these 
criteria are by no means necessary and sufficient (Ref. 4). Thus the 
question arises whether these criteria are inadequate or too conservative, 
and if instability and stability conditions can be derived using some 
approximate methods. 

There have indeed been a number of such attempts in the engineering 
literature. The best known of these approximate methods are the 
describing function (and its generalizations), the total gain linearization 
based on Aizerman's conjecture, and the incremental gain linearization. 
All these methods rest in essence on a common technical (but not 
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mathematical) principle. In particular* the describing function has been 
very extensively — and, it appears, successfully — applied to engineer- 
ing problems. 

There is one class of feedback systems for which necessary and 
sufficient conditions for stability are known, namely, the Nyquist 
criterion, which applies to feedback systems with a linear, time 
invariant system in the forward loop and a constant feedback gain. 
Thus, by associating with a nonlinear feedback system a well-chosen 
class of linear time invariant feedback systems, one tries to conclude 
stability or instability. 

The following section takes a critical look at some of these lineariza- 
tion procedures and exposes, by means of an example, unexpected 
periodic solutions in a class of nonlinear feedback systems. Although 
the system chosen to obtain this conclusion might seem quite special, 
the method of analysis remains applicable to other systems and will 
expose an essentially similar behavior. The example could also suggest 
to what extent and for which systems the existing frequency-domain 
stability criteria can be improved. They also demonstrate the need for 
caution in applying these linearization techniques in stability analysis. 

Consider the feedback system shown in Figure 7.1. The relation 



+ 



™-% 



Initial conditions 
y(t) 



R^T 



f(-) 



Figure 7.1 Feedback System 



between the input and the output of the element in the forward loop is 
determined by the ordinary time-invariant differential equation 

x(t) = Ax{t) + bu(t), 

y(t) = c'x(t), 

where A is a constant (n x n) matrix, and b and c are constant «- 
vectors. The transfer function of this system is thus given by G(s) — 
c'{Is — Ay^b and is the ratio of two polynomials in s with the degree 
of the numerator less than the degree of the denominator. The element 
/(■) in the feedback loop generates an output/(<r) when its input is <r, 
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and / is a Lipschitz-continuous mapping from the real line into itself. 
The differential equation describing the closed-loop system is thus 

x(t) = Ax(t) - bf{c'x{t)\ 

It is assumed that/(0) = 0. The solution x(t) == is called the null 
solution of this system and is said to be asymptotically stable in the large 
if it is stable (in the sense of Lyapunov) and if all solutions converge to 
the null solution for t -> oo. For convenience the feedback system under 
consideration is said to be asymptotically stable in the large if the null 
solution is. 

Rather than investigating input-output stability as in the remainder 
of this monograph, attention is focused in this section on asymptotic 
stability, in order that the reader may more easily compare the literature 
on these linearization methods. It can actually be shown, using the 
methods employed in Section 5.3, that Z^-stability, with 1 </? < oo, 
of the feedback system under consideration implies asymptotic stability 
in the large. 

For the case for which f(a) = Ka y this stability problem can be 
completely resolved using root-locus techniques, the Nyquist stability 
criterion or a Routh-Hurwitz test (Ref. 5) and thus presents in principle 
no difficulties. If/(o*) is nonlinear, however, this is not so, and often in 
engineering practice the question whether a particular feedback 
system of this type is asymptotically stable in the large is answered by 
considering a linearized model. 

Three common types of linearization are the dc type of linearization, 
the ac type of linearization, and the describing-function type of 
linearization. These are now formally defined. 

Definitions: Let /be a mapping from the real line into itself with 
7X0) — 0. The dc gain or the total gain of the nonlinearity /(or) at 
a (a j* 0) is defined by K t (<f) = /{<*)!<*• If/ is differentiate, then the 
regain or the incremental gain of the nonlinearity/(a) at tf is defined by 
K^a) = df{a)jda. If/satisfies for some M l9 M 2 the inequality |/(cr)| < 
M x + M % \a\ for all cr, then the describing-function gain (or the 
equivalent gain) of the nonlinearity /(a) at amplitude A(A ^ 0) is the 
complex number K d (A) defined by 



K d (A) 



ttA Jo 



f(A cos t)e jt dt. 



Clearly, K d (A) = {Ij-rrA) ffif{A cos./) cos / dt, where / is a single- 
valued function. The describing function can also be used when /is a 
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hysteresis loop or a similar multiple-valued memory-type nonlinearity. 
In that case the equivalent gain would in general be a complex number. 
One also often requires the nonlinearity to be odd; this is an attempt to 
eliminate the dc term in the output with a sinusoidal input. It should 
also be noted that the describing-function method has been extended so 
that it allows for more than simple sinusoidal inputs and that it is 
capable of treating uncertainty as well. 3 

The procedure by which linearization attempts to conclude stability 
for the dc, the ac, and the equivalent-gain types of linearization goes 
as follows: If the linear system with/(cr) — Ka is asymptotically stable 
for all K in the range of the dc gain, the ac gain, or the equivalent gain 
(i.e., for all K = K t (a), K « K^a), or K = K d (A) and all a or A) then 
the nonlinear system is asymptotically stable in the large. 

Both the dc type and the ac type of linearization and the resulting 
conclusions about stability have been the subject of rather well-known 
conjectures, due, respectively, to Aizerman (Ref. 7) and Kalman 
(Ref. 8). Particularly the Aizerman conjecture has received a lot of 
attention. 4 

The intuitive reasoning behind these conjectures in rather clear: in 
the total gain linearization (Aizerman's conjecture), one realizes that 
asymptotic stability is a property pertaining to the origin, and thus one 
replaces the nonlinearity by a linear gain that, viewed from the origin, 
gives instantaneously the same gain. It is clear that if stability results 
for all time-varying gains in the range of the dc gain , then the feedback 
system is indeed asymptotically stable in the large. However, there does 
not seem to be any reason to expect that the above procedure yields 
correct results. Indeed, it sometimes fails. For incremental gain 
linearization, one views the local properties of the nonlinearity and 
replaces the nonlinearity by a linear gain which is locally the same. One 

3 An excellent source for the describing function and its generalizations is Reference 
6. 

4 Originally published in 1949, it took until 1958 before Pliss (Ref. 9) gave a 
satisfactory counterexample. It is possible to show using the Popov criterion that 
for second-order systems the conjecture is true with the exception of some cases 
where the dc gain approaches for large values of its argument a gain for which the 
resulting linear system is not asymptotically stable. The counterexample given by 
Krasovskii (Ref. 10) is in fact of this kind. The counterexamples obtained by Pliss, 
however, are more satisfactory. The very stringent conditions on the nonlinearity 
and the involved mathematics kept the work of Pliss from being well known. More 
recently, Dewey and Jury (Ref. 11) and Fitts (Ref. 12) gave numerical counter- 
examples derived from a computer simulation. The conjecture due to Kalman in 
which the ac gain is used predicts stability in the large only for a subclass of the 
nonlinearities for which Aizerman's conjecture does. Fitts (Ref. 12) gives counter- 
examples to this conjecture derived from a computer analysis. 
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ignores in this process the fact that this linear gain does not extend to 
the origin. The incremental gain does correspond to a type of lineariza- 
tion: namely those linearizations obtained by considering constant 
inputs. The linearization thus suggests a continuity consideration, but 
the conclusion pertains to stability. Note that the range of the dc gain 
is always smaller than the range of the ac gain and thus that the ac gain 
type of linearization leads to more cautious conclusions than the dc 
gain type of linearization. 

The question of continuity of a particular feedback system can be 
answered by linearization, as indicated in Section 7.2. However, one 
needs thereby to replace the nonlinearity / by a gain k(t), which varies 
between a < h(t) < ft with a = inf aei2 K^a) and fi = sup ael2 ^(c), 
and the resulting linear system needs to be stable for any gain satisfying 
the constraint a < k(t) < /?. 

The describing-function technique is clearly inspired by linear 
time-invariant systems where the eigenfunctions (in L^) are sinusoids, 
and thus the responses to sinusoids are completely representative of the 
system behavior. This, however, does not carry over to nonlinear 
systems and makes the mathematical philosophy of the describing 
function unclear. However, it appears to yield good results, which is 
understandable in view of the fact that if a system sustains oscillations, 
then it is very likely that the first harmonics are in balance somewhere 
in the neighborhood of the oscillation. 

In what follows, a simple, rigorous proof of the existence of periodic 
solutions in a fourth-order system will be given. It will be shown, 
however, that all of the above-mentioned linearization techniques 
predict asymptotic stability in the large. These oscillations thus con- 
stitute a class of counterexamples to both Aizerman's conjecture and 
Kalman's suggestion. The results are obtained using perturbation 
theory (Refs. 13, 14). Since the ideas behind this technique are simple, 
the theorem from which the results follow will be proved. 



7.4 Averaging Theory 

Consider the ordinary differential equation 

x(t) = Ax{t) + €f(x(t) 9 z 9 c), 

where x(t) is an element of R n , A is a constant (n x n) matrix, z is a 
parameter (an element of jR m ), e is a scalar parameter, and/is a mapping 
from R n x R m x R into R n such that for all R, e , and M there exists 
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a constant K(R 9 e 09 M) (the Lipschitz constant) such that \\f(x l9 z 3 e) — 
f(x %9 zM <K\\x x - xjforalllixj, l|x 2 ll < *,M < e and|lz|| < M. 
Since the function /does not satisfy a global Lipschitz condition, it 
is not clear at this point whether a solution x{t) to the differential 
equation exists for all /. This problem is resolved in the next lemma. 

Definition: Let x(t) be a continuous map from [0,jT] into a Banach 
space. Then sup, et0(r] ||*(0II exists and is called the norm induced by the 
uniform topology. This space is complete. Recall that the contraction 
mapping principle states that if F is a map from a complete metric 
space X into itself with d(F(x) 9 F(y)) < (*.d(x 9 y) for all x 9 y eX and 
some a < 1 , then the equation x = Fx has a unique solution (called a 
fixed point of the mapping F). Moreover, picking any x Q and defining 
x&+i & Fx k> k e !*> y ields a sequence {x k } which converges in the metric 
of X to the fixed point of F. 



Lemma 7.2 

Given any r > 0, />, and M 9 then the above differential equation 
has a unique solution x(t) for any x(0),z,and t that satisfy ||x(0)|| < p 9 
< t < t, and ||z|| < M provided e is sufficiently small (i.e., for all e 
with \c\ < €j and some e x > 0). Moreover, this solution can be obtained 
using the successive approximations 



x Q (t) = e M x(0) 



and 



W) - e^<x(0) + € J V^/C^cr) A €) dtf 



forJfee/+ 

Proof: Let S be the sphere in the Banach space of all continuous 
mappings from [0,t] into R n with the uniform topology and with ||x(f)ll 
< 2pN where N ~ sup ^^ r l|e^«||. The mapping F defined on S by 

Fx(f) = e^c(0) + Se Mt - a) f{x(a) 9 z 9 e) da, 

maps S into itself for all |e| < e x with 

€l < min {€„, W)" 1 , pr+HpNK + 11/(0,0,0) ||)-i}, 

where K is the Lipschitz constant associated with R = 2pJV, € > 0, 
and M. Moreover, F is a contraction on S. The verification of these 
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facts is simple and will not be given explicitly. Thus, the equation 
x(t) = Fx(t) has a unique fixed point, which can be obtained using the 
successive approximations as stated. This yields the lemma. 

The next lemma exposes the dependence of x(t) on e more explicitly: 

Lemma 7.3 

Given any r > 0, p 9 and M, then the solution x(t) to the above differ- 
ential equation for any x(0) 9 z, and / that satisfy ||x(0)|| < />, < t < r, 
and ||z|| < M can be expressed as 

x(0 = e At x(fy + e f V (i - ff) /(e^ ff x(0),z, € ) da + € 2 L(*,x(0),z, e ) 

for all e sufficiently small (i.e., for all € with |e| < e 2 and some e 2 > 0). 
Moreover, L(?,x(0),z,e) is bounded for < t < t, ||x(0)|| < p 9 
\\z\\ < M, and | € | < e 2 . 

JVoo/V It will be shown that the (k + l)th element in the series of 
successive approximations introduced above is of this form provided 
the kth one is, and that the bound on L k can be taken to be independent 
of jfc. Since x x (0 is clearly of that form, the result follows by induction, 
because the limit for k -► oo exists and must also be of this form. Let K 
be the Lipschitz constant associated with 2pN 9 e 1? and M, and let 
€ 2 < min {€ l9 (2Nr)~ 1 }. A simple calculation then shows that IIZ^ill < 
rW(||/(0,0,0)|i + KNp) if \\L k \\ < rW 2 (||/(0,0,0)|| + KNp) 9 which 
then, in view of the above remarks, yields the lemma. 

Lemma 7.3 yields the following theorem on the existence of periodic 
solutions to the differential equation under consideration. 

Theorem 7.7 

If for € sufficiently small (i.e., for all e with |e] < «r and some e > 0) 
there exist bounded functions x(0)(e), T(e), and z(e) such that 



r*r<e) 



X (0)( e ) = e^ w x(0X«) + *J o e Al ™- al f(e A ™x(0)( e ),z(*),e) da 

+ € 2 L(r(<0,x(O)(e),z(e),€), 

then the differential equation under consideration has a periodic 
solution for e sufficiently small (i.e., for e with [e| < e 1 and some 
«i > 0). 



■p* 1 



176 



LINEARIZATION AND STABILITY 



Proof: Lemma 7.2 shows that for e sufficiently small x(r(e)) = x(0)(€) 9 
which, since the differential equation under consideration is time- 
invariant, yields a periodic solution of period T(e). 

Theorem 7.7 is not very useful as it stands, since it requires computa- 
tion of the function L and solving for the functions x(0)(€), r(e) and 
z(e). However, by using the implicit function theorem it is possible to 
obtain sufficient conditions for the conditions of Theorem 7.7 to be 
satisfied. 

In the theorem which follows, use will be made of the implicit 
function theorem (Ref. 1, p. 47), which states that iff maps R n x R m 
into R n and if: 

*• /CWo) — ° for some *o e R n> yo e R m> 

2. Q(x,y) exists and is continuous in a neighborhood of the point 

x 0i 7 , and 
3 - fjCWo)is of rank n 9 

then there exists a map, <f>, from R n into R m , which is continuous in a 
neighborhood of x Q and such that y = </>(x) yields f(<f>(x) 9 x) = for 
all x in some neighborhood of x Q . Moreover, y = <j>(x ) 9 and <j> is 
unique in a neighborhood of x . 

Theorem 7.8 

Assume that e AT « = / (i.e., that all solutions of x(t) — Ax{i) are 
periodic with period T ), and that/(x,z,e) is a continuous function of 
x, z 9 and e that has continuous first partial derivatives with respect to x 
and z for e sufficiently small (i.e., for all e with |e| < e and some 
* > 0.) Let 



F(x,z 9 c) A f °e~ A y(e A(T x,z 9 €) da 



and assume: 1, that Kxo^O) === 0; and 2, that the matrix 
~^ z (xq,z ,0) is of full rank. Then there exists a continuous function 
z(e) such that for e sufficiently small (i.e., for all e with \e\ < e x and 
some e x > 0) the differential equation under consideration has a 
periodic solution x*(t,e) with lim € _ z(e) = z and lim € _ x*(/,e) = 
e At x Q . 

Proof: The smoothness conditions on / together with the resulting 
smoothness of the solutions of ordinary differential equations (Ref. 
15, p. 29) ensure that the implicit function theorem is applicable. This 
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in turn shows that the conditions 1 and 2 of the theorem are sufficient 
to ensure that Theorem 7.7 is applicable, which leads to the conclusion 
of the theorem. 

This method of concluding the existence of periodic solutions for 
differential equations is known as Averaging Theory since the function 
F as defined in Theorem 7.8 requires the average value of the velocity 
vector along the solution e At XQ to be zero. 

7.5 Counterexamples to Aizerman's Conjecture 

Consider the differential equation 
x< 4 >(0 + 10jc< 2) (0 4- 9x{t) + €[ox<«(0 + p X W(t) 

+ y*<»(0 + dx(t)] + €/(*<«(*)) = 0, 
where / maps R into R and is continuously differentiable with respect 
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Figure 7.2 The Fourth-Order System to which Averaging Theory is Applied 
and which Yields Counterexamples to Aizerman's Conjecture 

to its argument. This equation describes the feedback system shown in 
Figure 7.2 and is equivalent to the following system of first-order 
differential equations : 
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where 0(e 2 ) denotes a four-dimensional vector which is such that 
lim e _x) [0(e 2 )/e] = 0. The application of Theorem 7.8 shows that there 
exist continuous functions a(e), /?(e), y(e), and d{d) such that the 
differential equation under consideration has a periodic solution, 
z*(/,€), with lim e ^ a(e), £0), y(e), d(e) = a , /? , y , <3 and 



lim z*(t 9 e) = e Ai 

€->0 



Z 2,0 

Z 3,0 

- Z 4,0- 






1 








1 




















3 








-3 


0. 



1 f 2 * 

(1) (yo " «o)zi.o + (A> ~ ^2,o *k ~ f(z lt0 cos a 

IT JO 

+ z 2i0 sin cr + z 3>0 cos 3a + z 4>0 sin 3a) sin a da = 0, 
1 f 2 * 

(A) — <* )z lt0 — (y — «0>8.0 + ~ /(Zi.o COS (T 

7T JO 

+ z 2j0 sin a + z 3t0 cos 3a + z 40 sin 3a) cos a do" = 0, 

1 f 2 * 
(yo/3 - 3a )z 3j0 + (fi - ^ /9)z 4)0 + - f(z lt0 cos cr 

7T JO 

+ z 2i0 sin a + z 3i0 cos 3a + z 40 sin 3a) sin 3a da = 0, 

i r 2,T 

(fl> — <5 /9)z 3t0 - (y /3 - 3oo)z 4f0 + - /(z 1>0 cos a 

77 Jo 

+ z 20 sin a + z 3i0 cos 3cr + z 40 sin 3a) cos 3a da = 0, 
and 

(2) (zj.0 + zl.^5.0 + 4o) ^ 0. 

Equation (2) guarantees that the matrix in Theorem 7.8 is of full rank, 
and equation (1) is derived from the requirement that the average be 
zero. 

From these conditions the following theorem, which will be central 
in establishing the counterexamples to Aizerman's conjecture is now 
derived. 

Theorem 7.9 

Iff (a) is not identically equal to ka for any constant k, then there 
exists a nonzero periodic solution to the differential equation under 
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consideration for e sufficiently small (i.e., for all e with ]e| < € and 
€ > 0), and proper choice of the functions a(e) 3 f}(e), y(e), and <5(e). 
Moreover, the functions a(e) and y(e) which yield this periodic solution 
satisfy the inequality 

(y(c) - a(e))(y(€) - 9a(«)) < 0. 

Proof: The proof proceeds as follows: First, a , /? , y 0? and <5 will be 
chosen in such a way that the determining equations are satisfied. If 
*i.o> z 2,o, Zs,o> and z 4i0 are chosen such that {z\ + 4 f oX 2 S f o + 4o) ^ °> 
then the determining equations can be solved uniquely for a , fi , y , 
and c3 , since the determinant of these linear equations in a , /?„, y , and 
<5 is then nonzero and yields in particular the following expression for 
yo - <V 

1 1 f 2ff 
yo - <*o = -J i~ - /( z i,o cos a + z 2i0 sin a 

Z l,0 + Z 2,0 ^ ^° 

+ z 3(0 cos 3a + z 4t0 sin 3a)(z 1(0 sin a — z 20 cos a) da. 

Iff is linear, then y — a = 0. Iff is not linear, then there exists a 
choice of z lt0 , z 2j0 , z 3f0 , and z 4i0 with (z* + z| (0 )(zg (0 + zf ) 5* such 
that y — «o 7^ 0. Moreover, from these determining equations it also 
follows that 

(yo - «o)0£o + z 2,o) + (yo - 9a )(z| j0 + z|,o) = 0. 

Thus (y — a )(y — 9a ) < for this choice of z lj0 , z 2i0 , z 3i0 , and z 40 
with (z 2 + z\ )(z 2 + z 2 ) 5^ 0, which yields the conclusion of the 
theorem. 

Consider now the zeros of the polynomial 

s* + 10s 2 + 9 + e(as 3 + fis* + ys + S) + £J& 2 . 
For e sufficiently small and for K bounded, these zeros lie: 

1. in Re s < if e > 0, a > 0, y > 0, and (7 - a)(y - 9a) < 0, 

or if e < 0, a < 0, y < 0, and (y - oc)(y - 9a) < 0; 

2. in Re s > if e < 0, a > 0, y > 0, and (y - a)(y - 9a) < 0, 

or if £ > 0, a < 0, y < 0, and (y - <x)(y - 9a) < 0. 

Thus all the linearization techniques would predict that the feedback 
system under consideration is asymptotically stable in the large provided 
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that € > 0, a > 0, y > and (y - <x)(y - 9a)< or that € < 0, 
a < 0, y < and (y — a)(y — 9a) < and the linearized gain satisfies 
< K < 1 . These regions are graphically shown in Figure 7.3. 
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Figure 7.3 Conditions on e, a, y to Obtain Counterexamples to Aizerman's 
Conjecture 



Choosing now a nonlinearity such that K d (a), K^a), and K d (A) 
satisfy < K < 1 (e.g., f(a) = tanh a), it is clear that for e 
sufficiently small and for values of a and y such that (y — oc)(y — 9a) < 
0, the sign of e can be chosen in such a way that the linearization 
techniques would predict the feedback system under consideration to 
asymptotically stable in the large. This, however, is in direct contra- 
diction with Theorem 7.9, which shows that the feedback system 
sustains a periodic solution. 

Remark 1: The choice of the function f(a) = tanh or, is irrelevant. 
In fact, the same conclusion holds for any nonlinearity, provided 
it is sufficiently smooth for Theorem 7.9 to be applicable and pro- 
vided |/(a)| < K\a\ for some K and all a which then yields, for e 
sufficiently small, the pole locations of the linearized system as given. 

Remark 2: The remarkable feature of the periodic solutions discovered 
in Theorem 7,9 is that (for € sufficiently small) they occur only when 
the linearized system has all its poles either always in the left half-plane 
or always in the right half-plane, contrary to what is to be expected 
from linearization. 

Remark 3; The Nyquist locus and the root locus of the fourth-order 
system under consideration are shown in Figure 7.4 for the case 
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Figure 7.4 Nyquist Locus of G(ja>) and Root Locus of the Linearized 
Feedback System 



6 > 0, a > 0, y > 0, 

y < 0, and (y — a)(y - 



and (y — <x){y - 9a) > or € < 0, a < 0, 
- 9a) < 0. 



Remark 4; The local stability properties of these periodic solutions is 
of course of interest. Variational techniques show that for proper 
choices of a, /?, y, a, e 5 and/(*) these periodic solutions can be locally 
stable. 



The existence of the periodic solutions discovered in this section will 
now be given an intuitive explanation. 5 This will of course be a plausi- 
bility argument. Averaging theory allows us to conclude that the 
argument is correct provided e is sufficiently small. 

Assume an input to the nonlinearity e/(-) which has a first harmonic, 
a third harmonic, and "small" other harmonics. The output to the 
nonlinearity will thus contain all harmonics, all of comparable magni- 
tudes, and all "small" since they have been multiplied by a small 
parameter e. Let x u x 3 , y u and j 3 be the Fourier coefficients of the first 
and the third harmonics of the input and the output to the nonlinearity. 
It can be shown that for particular choices of x 1 and y x the nonlinearity 
will shift the phases of the first and third harmonics toward one another, 

5 The dual-input describing function would predict these oscillations. However, 
the method of proof will yield counterexamples to the dual-input describing function 
when applied to higher-order systems. 
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Figure 7.5 The Frequency Spectrum of the Input and the Output of the 
Element in the Feedback Loop 



thus obtaining the situation depicted in Figure 7.5. The negative 
feedback leads to an input u to the forward loop — as is shown in 
Figure 7.6 — that with a Nyquist locus as in Figure 7.6 multiplies the 
first and third harmonic by a factor of order e" 1 and shifts their phases 
in the right direction but by an amount less than 180°, thus restoring 
the original relationship of x x and x z . The higher harmonics remain of 
order €. The loop can thus be closed and the feedback system sustains 
the oscillation. 




Figure 7.6 The Frequency Spectrum of the Input and the Output of the 
Element in the Forward Loop 
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